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METHODS FOR DIAGNOSING, PREVENTING, AND TREATING 
DEVELOPMENTAL DISORDERS DUE TO A COMBINATION OF 
GENETIC AND ENVIRONMENTAL FACTORS 

FIELD OF THE INVENTION 

5 The invention relates generally to novel methods of diagnosing, preventing, and 
treating specific diseases which are caused by a combination of genetic and 
environmental factors. One such disease exemplified is schizophrenia. 

BACKGROUND OF THE INVENTION 

The term "schizophrenia" was introduced by Bleuler in the beginning of this century 
1 0 to encompass a dissociation or disruption of thought processes, along with a 

dichotomy among thought, emotion, and behavior [Bleuler, Translation J. Zinkin, 
New York: International University Press (1950)]. The current definition of 
schizophrenia includes a break with reality that is usually manifested as 
hallucinations, delusions, or disruption in thought processes [Carpenter et al t Medical 
1 5 Progress, 330:681-690 (1994)]. At present the nationally accepted definition for the 
diagnosis of schizophrenia is contained in Diagnostic and Statistical Manual for 
Mental Disorders, Fourth Edition, Washington, D.C (1994): American Psychiatric 
Association, hereby incorporated by reference in its entirety. 

Schizophrenia is a clinical syndrome that has a profound influence on public health. 

20 The symptoms for schizophrenia begin early in life, and continues for most patients 
throughout their lives. An estimate of the direct and indirect costs of schizophrenia 
was thirty-three billion dollars for 1990 in the United States alone [Carpenter et al, 
1994, supra]. Indeed, one of every forty dollars spent for total heath care 
expenditures in the United States is spent on treating schizophrenia [Rupp et al f 

IS Psychiatric Clin. North Am., 16:413-423 (1993)]. Furthermore, estimates have been 
made suggesting that up to 50% of the homeless American population is 
schizophrenic [Bachrach, In: Treating the Homeless Mentally III Washington, D.C, 
American Psychiatric Press, 1 3-40, Lamb et al ed. (1 992)]. 
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The genetic factors in schizophrenia, though clearly documented to be present, are not 
simple [Carpenter and Buchanan, N. Engl. J. Med., 330:681-689 (1994); Gottesman, 
Clin. Genet., 46:1 16-123 (1994)]. Schizophrenia is, at least in part, a 
neurodevelopmental disorder, a birth defect in which the brain has been subtly 
5 damaged during development [Carpenter and Buchanan, N. Engl. J. Med., 330:681- 
689 (1994); Weinberger, Arch. Gen. Psychiatry, 44:660-669 (1987); Brixey et al, J. 
Clin. Psychol, 49:447-456 (1993)]. Evidence of this damage is seen both at autopsy 
[Kovelman and Scheibel, Biol. Psychiatry, 19:1601-1621 (1984); Bogerts et al, Arch. 
Gen. Psychiatry, 42:784-791 (1985); Jakob and Beckman, J. Neural Transm., 
10 65:303-326 (1986); Brown et al, Arch. Gen. Psychiatry, 43:36-42 (1986); Benes and 
Bird, Arch Gen Psychiatry, 44:608-616 (1987); Colter et al, Arch Gen Psychiatry, 
44:1023 (1987); Altshuler et al., Arch. Gen. Psychiatry, 47:1029-1034 (1990); 
Pakkenberg, Schizophr. Res., 7:95-100 (1992); Bogerts, Schizophr. Bull., 19:431-445 
(1993); Shapiro, Schizophr. Res., 10:187-239 (1993)] and by neuroimaging [Jeste et 
15 al., Br. J. Psychiatry, 153:444-459 (1988); Suddath et al, Am. J. Psychiatry, 

146:464-472 (1989); Suddath et al., N. Engl. J. Med., 322:789-794 (1990); DeLisi et 
al, Biol Psychiatry, 29:159-175 (1991); Breier et al, Arch. Gen. Psychiatry, 49:921- 
926 (1992); O'Callaghan et al, J. R. Soc. Med, 85:227-231 (1992); Bogerts et al, 
Biol Psychiatry, 33:236-246 (1993); Andreasen et al, Science, 266:294-298 (1994)]. 
20 The pattern of this brain damage and the presence of minor congenital abnormalities 
point to an insult occurring during the second trimester of fetal development [Bracha 
et al, Biol Psychiatry, 30:719-725 (1991); Bracha et al, Am. J. Psychiatry, 
149:1355-1361 (1992); Green etal, Psychiatry Res., 53:119-127 (1994)]. 
Epidemiological studies have documented a season-of-birth effect by which 
25 schizophrenics are more frequently born during winter and early spring than during 
other seasons [Boyd et al, Schizophr. Bull, 12:173-186 (1986); Kendell and Adams, 
Br. J. Psychiatry, 158:758-763 (1991); O'Callaghan et al, Br. J. Psychiatry, 
158:764-769 (1991)]. Also, individuals exposed to an influenza epidemic [Mednick 
et al, Arch. Gen. Psychiatry, 45:189-192 (1988); Barr et al.,Arch. Gen. Psychiatry, 
30 47:869-874 (1990); O'Callaghan et al, Lancet., 337:1248-1250 (1991); Murray et 
al, J. Psychiatr. Res., 26:225-235 (1992); Adams et al, Br. J. Psychiatry, 163:522- 
534 (1993)] or famine [Susser and Lin, Arch. Gen. Psychiatry, 49:983-988 (1992)] 
during their second trimester of fetal development have increased risk of later 
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developing schizophrenia, according to some studies but not others [Kendell, Arch. 
Gen. Psychiatry, 46:878-882 (1989); Crow and Done, Br. J. Psychiatry, 161:390-393 
(1992)]. This has suggested that an environmental effect such as dietary deficiency, 
virus infection [Kirch, Schizophr. Bull., 19:355-370 (1993)], vitamin deficiency, or 
effect of cold weather may be acting during fetal development. 

Linkage mapping studies in schizophrenia have been difficult. Recently, some 
studies [Straub et al, Nature Genet, 11:287-293 (1995); Schwab et al, Nature 
Genet, 11:325-327 (1995); Moises et al. Nature Genet, 11:321-324 (1995)] have 
supported a gene locus on chromosome 6 (6p24-22, near the HLA region) as having 
an effect in schizophrenia; other studies gave little or no support to a marker in this 
region [Wang et al, Nature Genet, 10:41-46 (1995); Mowry et al, Nature Genet, 
11:233-234 (1995); Gurling et al, Nature Genet, 11:234-235 (1995); Antonarakis et 
al, Nature Genet, 11:235-236 (1995)]. At best this locus appeared to be involved in 
only about 15-30% of families [Straub et al, 1995, supra]. Also, some evidence for 
loci on chromosomes 3 [Pulver et al, Am. J. Med. Genet, 60:252-260 (1995), 8 
[Pulver et al, Am. J. Med. Genet, 60:252-260 (1995); Kendler et al, Am. J. Psych. 
153:1534-1540 (1996), 9 [Coon et al, Biol Psychiatry, 34:277-289 (1993); Moises et 
al, Nature Genet, 11:321-324 (1995)] and 22 [Coon et al, Am. J. Med. Genet, 
54:72-79 (1994); Pulver et al, Am. J. Med. Genet, 54:3-43 (1994)]have been 
reported. In addition, two polymorphic markers very close to the gene encoding 
dihydrofolate reductase (DHFR) on chromosome 5q, D5S76 and D5S39, gave very 
high lod scores (as high as 6.49, i.e. odds of about 3 million to one in favor of genetic 
linkage versus chance occurrence) in 7 British and Icelandic schizophrenia families 
studied [Schwab et al, Nat Genet. 11:325-327 (1997); Straub et al, Molec 
Psychiatr. 2:148-155 (1997)]. However, this result could not be confirmed in studies 
of numerous other families. 

There could be several reasons for this difficulty. First, there may be more than one 
gene involved, (locus heterogeneity). Second, the genetic factor(s) may be common 
in the population (high disease allele frequency), thus diminishing the power of 
linkage studies [Terwilliger and Ott, Handbook of Human Genetic Linkage, 
Baltimore: Johns Hopkins Univ. Pr., 181 (1994)]. Third, the correct genetic model 
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may be unknown [Owen, Psychol. Med., 22:289-293 (1992)]. Any or all of these 
factors could diminish the power of a linkage study sufficiently to make success very 
difficult [Terwilliger and Ott, 1 994, supra] . 

Thus the current (developmental) model for schizophrenia is that genetic and 
5 environmental factors cause brain damage in a fetus that later develops schizophrenia. 
However, the genetic and environmental factors have not been identified. Also, 
extensive linkage and association studies have failed to identify genes determining 
schizophrenia. 

Indeed, schizophrenia appears to be just one of a family of developmental disorders 
10 whose cause has not been identified. Other such developmental disorders are defined 
by the Diagnostic and Statistical Manual for Mental Disorders, Fourth Edition, 
Washington, D.C (1994) and include: Tourette Syndrome which is identical to 
Tourette's Disorder and is a subcategory of Tic Disorders; Bipolar Disorder which is 
identical with Bipolar I Disorder or Bipolar II disorder; Autism which is identical 
15 with Autistic Disorder which is a subcategory of Pervasive Developmental Disorders; 
Conduct disorder which is a subcategory of Attention-Deficit and Disruptive 
Behavioral Disorders; Attention-Deficit Hyperactivity Disorder which is identical to 
Attention-Deficit/Hyperactivity Disorder and to Attention-Deficit/Hyperactivity 
Disorder NOS (not otherwise specified) which is also a subcategory of Attention- 
20 Deficit and Disruptive Behavioral Disorders; Obsessive-Compulsive Disorder which 
is a subtype of Anxiety Disorders; Chronic Multiple Tics Syndrome which is identical 
to Chronic Motor or Vocal Tic Disorder which is a subtype of Tic Disorders; and 
Learning Disorders. 

In addition Spina bifida is a developmental disorder. Spina bifida is a form of neural 
25 tube defect in which neural elements (spinal nerves or spinal chord) or coverings of 
the brain and spinal chord (dura mater, arachnoid mater) herniate through a midline 
defect into a cystic cavity covered completely or partially by skin. 

Therefore, there is a need for new methods of diagnosing individuals susceptible to 
developing a developmental disorder. In addition, there is a need for methods of 
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identifying individuals susceptible to having offspring that develop a developmental 
disorder. Finally, there is a need for a method of treating such susceptible individuals 
in order to prevent and/or ameliorate the symptoms due to and/or associated with the 
developmental disorder. 

5 The citations of any reference herein should not be construed as an admission that 
such reference is available as "Prior Art" to the instant application. 

SUMMARY OF THE INVENTION 

The present invention provides methods of diagnosing, preventing and/or treating 
specific developmental disorders. Towards this end the present invention provides 

10 methods of identifying an individual as being genetically or environmentally 

susceptible for developing or having a developmental disorder or for having offspring 
that develop the developmental disorder. Such a developmental disorder can be 
schizophrenia, spina bifida cystica, Tourette's syndrome, bipolar illness, autism, 
conduct disorders, attention deficit disorder, obsessive compulsive disorder, chronic 

1 5 multiple tic syndrome and learning disorders such as dyslexia. In addition, any of the 
methods provided herein for identifying an individual as being genetically and/or 
environmentally susceptible for having or developing a developmental disorder or for 
having offspring that develop the developmental disorder can also be used in 
diagnosing the individual, preferably in conjunction with a clinical diagnosis. 
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due to both environmental and genetic factors. The present invention further provides 
methods of identifying an individual as being susceptible for having offspring that are 
susceptible for developing a developmental disorder 

The present invention therefore provides methods for compiling genetic reference 
5 datasets, environmental reference datasets and/or genetic and environmental reference 
datasets for use in determining a predicted probability for an individual of having a 
susceptibility for having or developing a developmental disorder, or for having 
offspring that develop a developmental disorder. 

In one aspect of the invention, the present invention provides methods that comprise 
1 0 generating a genetic reference dataset for use in determining the predicted probability 
of an individual for having a susceptibility for having or developing a developmental 
disorder due to genetic factors, or for having offspring that develop a developmental 
disorder due to genetic factors. 

One such embodiment comprises collecting a biological sample from a human 
15 subject. The human subject can be a diagnostic proband, a blood relative of the 
diagnostic proband, an affected proband, a blood relative of the affected proband, a 
control proband, and/or a blood relative of the control proband. The biological 
sample contains nucleic acids and/or proteins from the human subject. The nucleic 
acids and/or proteins from the biological sample are then analyzed resulting in a 
20 partial or full genotype for the alleles of the genes involved in folate, pyridoxine, 
and/or cobalamin metabolism. The partial or full genotype then forms a dataset of 
genetic explanatory variables for the human subject. The dataset of genetic 
explanatory variables is then compiled from multiple human subjects into a genetic 
reference dataset. Such compilations are exemplified in the Detailed Description and 
25 Examples below. 

In another aspect, the present invention provides a method that comprises generating 
a genetic and environmental reference dataset for use in determining the predicted 
probability of an individual for having a susceptibility for having or developing a 
developmental disorder due to genetic factors and environmental factors, or for 
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having offspring that develop a developmental disorder due to genetic factors and 
environmental factors. One such embodiment comprises obtaining dietary and 
epidemiological information for environmental explanatory variables for the human 
subjects and combining the environmental explanatory variables with a genetic 
5 reference dataset for the human subjects as described above. 

In another aspect, the present invention provides an environmental reference dataset 
for use in the determination of the predicted probability for an individual for having a 
susceptibility for having or developing a developmental disorder due to 
environmental factors, or for having offspring that develop a developmental disorder 

1 0 due to environmental factors One such embodiment comprises obtaining dietary and 
epidemiological information for environmental explanatory variables for a human 
subject. The human subject can be a diagnostic proband, a blood relative of the 
diagnostic proband, an affected proband, a blood relative of the affected proband, a 
control proband, or a blood relative of the control proband. The dataset of 

1 5 environmental explanatory variables is then compiled from multiple human subjects 
into an environmental reference dataset for the human subjects. 

The developmental disorder forming the basis of the reference datasets of the present 
invention can be schizophrenia, or spina bifida cystica, or Tourette's syndrome, or 
dyslexia, or conduct disorder, or attention-deficit hyperactivity disorder, or bipolar 

20 illness, or autism, or chronic multiple tic syndrome or obsessive-compulsive disorder, 
or like disorders. A blood relative is preferably the mother of the individual, a 
sibling, the father or a grandparent of the individual. When the reference dataset is 
for use in the determination of the predicted probability for an individual of having a 
susceptibility for having offspring that develop a developmental disorder, the 

25 individual is preferably a pregnant woman. The reference datasets of the present 
invention are themselves part of the present invention. 

The present invention further provides methods of estimating the genetic 
susceptibility of an individual to have or to develop a developmental disorder, or to 
have offspring that develop a developmental disorder. In one such embodiment the 
30 method comprises collecting a biological sample from a participant (or participants) 
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who is either the individual or a blood relative of the individual. The biological 
sample contains nucleic acids and/or proteins of the participant. The analysis of the 
nucleic acids and/or proteins from the biological sample yield a partial or full 
genotype for the alleles of the genes involved in folate, pyridoxine, and/or cobalamin 
metabolism. The partial or full genotype forms a dataset of genetic explanatory 
variables for the participants. The dataset of genetic explanatory variables obtained 
are added to a genetic reference dataset forming a combined genetic dataset. A model 
is then formulated comprising the genetic explanatory variables obtained from the 
participants and the combined genetic dataset is analyzed. A predicted probability for 
the individual for having and/or developing a developmental disorder and/or having 
offspring that develop a developmental disorder is then determined. The genetic 
susceptibility of an individual to have or to develop a developmental disorder and/or 
have offspring that develop a developmental disorder is estimated. In a preferred 
embodiment, analyzing the combined genetic dataset is performed by binary linear 
regression. In a more preferred embodiment, the binary linear regression is 
performed with the SAS system. In another preferred embodiment, the model is 
modified by adding or subtracting one or more genetic explanatory variables and the 
combined genetic dataset is re-ana]yzed, preferably by binary logistic regression. In 
this case a model is chosen that best fits the data. This can be accomplished by 
testing the model for goodness of fit. 

The present invention also provides methods of estimating the genetic and 
environmental susceptibility of an individual to have or to develop a developmental 
disorder and/or for having offspring that develop a developmental disorder. One such 
embodiment comprises collecting a biological sample from one or more participants. 
Again, the participant is either the individual or a blood relative of the individual. 
The biological sample contains nucleic acids and/or proteins of the participant. The 
nucleic acids and/or proteins from the biological sample are analyzed resulting in a 
partial or full genotype for the alleles of the genes involved in folate, pyridoxine, 
and/or cobalamin metabolism. The partial or full genotype forms a dataset of genetic 
explanatory variables for the participant. Dietary and epidemiological information 
for environmental explanatory variables for the participant(s) are also obtained which 
are used to form a dataset of environmental explanatory variables for the 
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participant(s). The datasets of genetic explanatory variables and the dataset of 
environmental explanatory variables are added to a genetic and environmental 
reference dataset forming a combined genetic and environmental dataset. A model is 
formulated comprising the genetic and environmental explanatory variables obtained 

5 from the participant(s). The combined genetic and environmental dataset is then 
analyzed and a predicted probability for the individual for having and/or developing a 
developmental disorder and/or for having offspring that develop a developmental 
disorder is determined. The genetic and environmental susceptibility of an individual 
to have or to develop a developmental disorder and/or have offspring that develop a 

10 developmental disorder is estimated. In a preferred embodiment, analyzing the 

combined genetic and environmental dataset is performed by binary linear regression. 
In a more preferred embodiment the binary linear regression is performed with the 
SAS system. In another preferred embodiment the model is modified by adding or 
subtracting one or more genetic and/or environmental explanatory variables and the 

1 5 combined genetic and environmental dataset is re-analyzed preferably, by binary 
logistic regression. In this case a model is chosen that best fits the data. This can be 
accomplished by testing the model for goodness of fit. 

For any of these methods, the developmental disorder can be schizophrenia, spina 
bifida cystica, Tourette's syndrome, bipolar illness, autism, conduct disorder, 
20 attention deficit hyperactivity disorder, obsessive compulsive disorder, chronic 
multiple tic syndrome and learning disorders such as dyslexia. 

In a particular embodiment, the individual is suspected of being genetically 
susceptible of having or for developing the developmental disorder and/or of being 
genetically susceptible of having offspring that develop the developmental disorder. 

25 In a preferred embodiment of this type, the individual is suspected of being 
genetically susceptible for having or for developing the developmental disorder 
and/or of being genetically susceptible of having offspring that develop the 
developmental disorder because a blood relative has the developmental disorder. In 
one such embodiment the blood relative is a parent, a sibling, or a grandparent. In a 

30 preferred embodiment the blood relative is the mother of the individual. In a 
particular embodiment in which the individual is suspected of being genetically 
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participant s). The datasets of genetic explanatory variables and the dataset of 
environmental explanatory variables are added to a genetic and environmental 
reference dataset forming a combined genetic and environmental dataset. A model is 
formulated comprising the genetic and environmental explanatory variables obtained 
> from the participant(s). The combined genetic and environmental dataset is then 
analyzed and a predicted probability for the individual for having and/or developing a 
developmental disorder and/or for having offspring that develop a developmental 
disorder is determined. The genetic and environmental susceptibility of an individual 
to have or to develop a developmental disorder and/or have offspring that develop a 
0 developmental disorder is estimated. In a preferred embodiment, analyzing the 

combined genetic and environmental dataset is performed by binary linear regression. 
In a more preferred embodiment the binary linear regression is performed with the 
SAS system. In another preferred embodiment the model is modified by adding or 
subtracting one or more genetic and/or environmental explanatory variables and the 
5 combined genetic and environmental dataset is re-analyzed preferably, by binary 
logistic regression. In this case a model is chosen that best fits the data. This can be 
accomplished by testing the model for goodness of fit. 

For any of these methods, the developmental disorder can be schizophrenia, spina 
bifida cystica, Tourette's syndrome, bipolar illness, autism, conduct disorder, 
20 attention deficit hyperactivity disorder, obsessive compulsive disorder, chronic 
multiple tic syndrome and learning disorders such as dyslexia. 

In a particular embodiment, the individual is suspected of being genetically 
susceptible of having or for developing the developmental disorder and/or of being 
genetically susceptible of having offspring that develop the developmental disorder. 
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30 preferred embodiment the blood relative is the mother of the individual. In a 
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susceptible of having offspring that develop the developmental disorder, the 
individual is a pregnant woman. In another such embodiment the individual is the 
mate of the pregnant woman. In a particular embodiment exemplified below, the 
developmental disorder is schizophrenia. 

5 Since the availability of the data regarding the genetic and environmental explanatory 
factors can vary in separate determinations, variations in the explanatory factors used 
is clearly envisioned by the present invention. 



The present invention further provides methods of lowering the risk of a pregnant 
woman to have a child that will develop a developmental disorder. One such 

1 0 embodiment comprises administering methylfolate, cobalamin or pyridoxine to the 
pregnant woman and/or fetus, which lowers the risk of the pregnant woman to give 
birth to a child with a developmental disorder. In a particular embodiment of this 
type, the pregnant woman had been previously determined to be susceptible of having 
offspring that develop a developmental disorder by a method disclosed herein. The 

1 5 present invention further provides a method of determining if any treatment is 

advisable for a pregnant woman that is genetically susceptible to having offspring that 
develop a developmental disorder which comprises determining the concentration of 
a risk factor from a tissue sample or body fluid from the pregnant woman. When the 
concentration of the risk factor is statistically above or below an accepted normal 

20 range, treatment is advisable. 

The present invention further provides methods of determining if any treatment is 
advisable for a pregnant woman who has been determined to be susceptible to having 
offspring that develop a developmental disorder. One such embodiment comprises 
determining the concentration of a risk factor from a tissue sample or body fluid from 
25 the pregnant woman. When the concentration of the risk factor is statistically above 
or below an accepted normal range, treatment is advisable. In a particular 
embodiment of this type, the pregnant woman had been previously determined to be 
susceptible of having offspring that develop a developmental disorder by a method 
disclosed herein. 
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Methods of monitoring the effect of the administration of methylfolate, cobalamin or 
pyridoxine to the pregnant woman who has been determined to be susceptible to 
having offspring that develop a developmental disorder are also included in the 
present invention. One such embodiment comprises determining the concentration of 

5 a risk factor from a tissue sample or body fluid from the pregnant woman. When the 
concentration of the risk factor is statistically within an accepted normal range, the 
treatment is deemed effective. In a particular embodiment of this type, the pregnant 
woman had been previously determined to be susceptible of having offspring that 
develop a developmental disorder by a method disclosed herein. The risk factor can 

10 be any substance and/or metabolite linked to folate and/or cobalamin and/or 

pyridoxine metabolism. In one embodiment, the risk factor is homocysteine. In yet 
another embodiment, the risk factor is folate. In still another embodiment, the risk 
factor is cobalamin. 

The present invention also provides a method of treating an asymptomatic individual 
1 5 determined to be susceptible for developing a developmental disorder comprising 

administering methylfolate, cobalamin and/or pyridoxine. In a particular embodiment 
of this type, the asymptomatic individual had been previously determined to be 
susceptible of developing a developmental disorder by a method disclosed herein. 

The DNA samples from the persons tested may be obtained from any source 
20 including blood, a tissue sample, amniotic fluid, a chorionic villus sampling, 
cerebrospinal fluid, and urine. 

The present invention includes but is not limited to the examples of proteins encoded 
by genes involved in folate, cobalamin and pyridoxine metabolism compiled in 
Tables 2-7 in the Detailed Description of the Invention, below. For certain genes 

25 nucleic acid and/or amino acid sequence data is also provided. These genes and 

related sequence data are solely intended as examples of genes that are suitable to be 
used in the methods described herein. Such sequence data can be used for carrying 
out the genetic analysis of the present invention. However, the present invention is 
not intended to be limited in any way to such lists of proteins or the related sequence 

30 data. 
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It is further contemplated by the present invention to provide methods that include the 
testing for a genetic mutations in individual genes involved in folate and cobalamin 
metabolism and/or in individual combinations of such genes (e.g., 
methylenetetrahydrofolate reductase gene and methionine synthase). In addition, all 
possible combinatorials, and permutations of such genes including a constellation 
comprising all of the genes involved in folate, pyridoxine, and cobalamin metabolism 
is envisioned by the present invention. Alternatively, a constellation of genes in 
which any one or more genes can be excluded from those tested is also contemplated 
by the present invention (for example, a given constellation of genes can include 
genes encoding all of the proteins in Table 2 and 4 except the folate receptor 2-like 
protein). Thus all of such possible constellations are envisioned by, and are therefore 
part of the present invention. 

The present invention also provides DNA polymorphisms that can be used as genetic 
explanatory factors in the present invention. One such embodiment is a nucleic acid 
encoding a genetic variant of human dihydrofolate reductase comprising a nucleotide 
sequence having a 19 base-pair deletion spanning nucleotides 540 to 558 of the 
nucleotide sequence of SEQ ID NO:41. In a preferred embodiment the nucleic acid 
has the nucleotide sequence of SEQ ID NO:42. 

The present invention also includes primers. One such embodiment is a PCR primer 
that can be used to distinguish SEQ ID NO:42 from SEQ ID NO:41 . Another 
embodiment is a PCR primer that can be used to distinguish SEQ ID NO:42 from 
SEQ ID NO:45. These primers are useful for identifying the 19 base-pair deletion 
spanning nucleotides 540 to 558 of the nucleotide sequence of SEQ ID NO:41 (see 
Example 2). In a particular embodiment, the PCR primer comprises 8 to 100 and 
preferably 10 to 50 consecutive nucleotides from the nucleotide sequence of SEQ ED 
NO:41 . In another embodiment the PCR primer comprises 8 to 100 and preferably 1 0 
to 50 consecutive nucleotides from the nucleotide sequence of the complementary 
strand of SEQ ID NO:41 . In still another embodiment the PCR primer comprises 8 to 
100 and preferably 10 to 50 consecutive nucleotides from the nucleotide sequence of 
SEQ ID NO:42. In yet another embodiment the PCR primer comprises 8 to 100 and 
preferably 10 to 50 consecutive nucleotides from the nucleotide sequence of the 
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complementary strand of SEQ ID NO:42. In still another embodiment the PCR 
primer comprises 8 to 100 and preferably 10 to 50 consecutive nucleotides from the 
nucleotide sequence of SEQ ID N0.45. In yet another embodiment the PCR primer 
comprises 8 to 100 and preferably 10 to 50 consecutive nucleotides from the 
5 nucleotide sequence of the complementary strand of SEQ ID NO:45. 

In a particular embodiment the PCR primer comprises 8 to 100 and preferably 10 to 
50 consecutive nucleotides from nucleotides 350 to 530 of SEQ ID NO:41. In a 
preferred embodiment of this type, the PCR primer has the nucleotide sequence of 
CTAAACTGCATCGTCGCTGTG (SEQ ID NO:38). In another particular 

1 0 embodiment the PCR primer comprises 8 to 1 00 and preferably 1 0 to 50 consecutive 
nucleotides from the complementary strand of nucleotides 550 to 850 of SEQ ID 
NO:41 . In preferred embodiment of this type, the PCR primer comprises 8 to 100 
and preferably 10 to 50 consecutive nucleotides from the complementary strand of 
nucleotides 570 to 690 of SEQ ID NO:41. In a particular embodiment, the PCR 

1 5 primer has the nucleotide sequence of AAAAGGGG AATCC AGTCGG (SEQ ID 
NO:39). 

The present invention also provides a nucleic acid that hybridizes under standard 
hybridization conditions to the nucleotide sequence ACCTGGGCGGGACGCGCCA 
(SEQ ID NO:40). In another embodiment the nucleic acid hybridizes under standard 

20 hybridization conditions to the nucleotide sequence complementary to SEQ ID 
NO:40. In yet another embodiment the nucleic acid hybridizes under standard 
hybridization conditions to the nucleotide sequence ACCTGGGCGGGACGCGCC 
(SEQ ID NO:46). In yet another embodiment the nucleic acid hybridizes under 
standard hybridization conditions to the nucleotide sequence complementary to SEQ 

25 ID NO:46. In a particular embodiment the nucleic acid consists of 9 to 96 

nucleotides. In another embodiment the nucleic acid consists of 12 to 48 nucleotides. 
In still another embodiment the nucleic acid consists of 15 to 36 nucleotides. In a 
preferred embodiment the nucleic acid consists of 17 to 20 nucleotides. 

The present invention also provides a nucleic acid that hybridizes to the nucleotide 
30 sequence of SEQ ID NO:41, but not to the nucleotide sequence of SEQ ID NO:42 
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when the hybridization is performed under identical conditions. In a particular 
embodiment the nucleic acid comprises the nucleotide sequence of 
CCCACGGTCGGGGTACCTGGGCGGGACGCGCCAGGCCGACTCCCGGCGA 
(SEQ ID NO:29). The present invention further provides a nucleic acid that 
hybridizes to the nucleotide sequence of SEQ ID NO:42, but not to the nucleotide 
sequence of SEQ ID NO:41 when the hybridization is performed under identical 
conditions. In a particular embodiment the nucleic acid comprises the nucleotide 
sequence of CCCACGGTCGGGGTGGCCGACTCCCGGCGA (SEQ ID NO:37). 

In a related embodiment the present invention provides an isolated nucleic acid that 
hybridizes to the complementary strand of the nucleotide sequence of SEQ ID NO:42, 
but not to the complementary strand of the nucleotide sequence of SEQ ID NO:41 
when the hybridization is performed under identical conditions. In still another 
embodiment the nucleic acid hybridizes to the nucleotide sequence of SEQ ID 
NO:41, but not to the nucleotide sequence of SEQ ID NO:42 when the hybridization 
is performed under identical conditions. In still another embodiment the nucleic acid 
hybridizes to the complementary strand of the nucleotide sequence of SEQ ID NO:41 , 
but not to the complementary strand of the nucleotide sequence of SEQ ID NO:42 
when the hybridization is performed under identical conditions. 
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the nucleotide sequence of SEQ ID NO:42 when the hybridization is performed under 
identical conditions. In still another embodiment the nucleic acid hybridizes to the 
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complementary strand of the nucleotide sequence of SEQ ID NO:42 when the 
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The present invention also provides for the use of the nucleic acids of the present 
invention (as well as other nucleic acids which can be used to identify DNA 
polymorphisms in the alleles of the genes involved in folate, pyridoxine, and/or 
cobalamin metabolism) in the methods of the present invention for identifying, 
diagnosing, preventing and/or treating individuals. 

In methods of estimating the susceptibility due to genetic or genetic and 
environmental factors for an individual to have or to develop a developmental 
disorder or to have offspring that develop a developmental disorder, and for the 
corresponding methods of generating genetic, or genetic and environmental reference 
datasets, the present invention provides a step of analyzing nucleic acids and/or 
proteins from biological samples. In one particular embodiment, the assaying for the 
presence of the genetic variant of human dihydro folate reductase having a nucleotide 
sequence with a 19 base-pair deletion spanning nucleotides 540 to 558 of the 
nucleotide sequence of SEQ ID NO:41 is included as part of this analysis. This 
genetic variant of human dihydrofolate reductase becomes a genetic explanatory 
variable. 

Determining if the biological sample contains the genetic variant of human 
dihydrofolate reductase having a nucleotide sequence with a 19 base-pair deletion 
spanning nucleotides 540 to 558 of the nucleotide sequence of SEQ ID NO:41 can be 
performed by any appropriate method including PCR, special PCR, RT PCR, RFLP 
analysis, SSCP, and FISH. 

In addition, all of the nucleic acids of the present invention including cDNA or 
genomic DNA can be placed into expression vectors operably associated with an 
expression control sequence. Alternatively, when the nucleic acid is part of an 
expression control sequence, the nucleic acid and/or the expression control sequence 
can be placed into an expression vector to control the expression of a coding 
sequence, such as a reporter gene. Such expression vectors can then be placed into 
either eukaryotic or prokaryotic host cells and expressed. The host cells comprising 
the expression vectors are also part of the present invention. In addition, when the 
nucleic acid includes a coding sequence or a part of a coding sequence, the present 
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invention includes methods of purifying the gene products from the coding sequence 
or part thereof, and the purified gene products themselves. 

Accordingly, it is a principal object of the present invention to provide a method for 
identifying an individual that is genetically inclined to develop a developmental 
5 disorder or disease. 

It is a further object of the present invention to provide a method for identifying an 
individual that is genetically inclined to develop schizophrenia. 

It is a further object of the present invention to provide a method for identifying an 
individual that is genetically inclined to have offspring having a developmental 
1 0 disorder. 

It is a further object of the present invention to provide a method of diagnosing 
schizophrenia. 

■4 

It is a further object of the present invention to provide a method of treating 
developmental disorders such as schizophrenia. 

1 5 It is a further object of the present invention to provide a method for monitoring the 
treatment of the developmental disorder. 

It is a further object of the present invention to provide a method for ameliorating the 
effect of a defect in folate, pyridoxine or cobalamin metabolism on a fetus due to the 
genetic or environmental status of a pregnant woman. 

20 It is a further object of the present invention to provide a method of treating a patient 
who is genetically inclined to develop a developmental disorder such as 
schizophrenia. 
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It is a further object of the present invention to provide a method of overcoming a 
nutritional lack of folate, cobalamin or pyridoxine of a pregnant woman to prevent the 
development of the corresponding fetus developing a developmental disorder. 

Other objects and advantages will become apparent to those skilled in the art from a 
5 review of the ensuing description. 

These and other aspects of the present invention will be better appreciated by 
reference to the following drawings and Detailed Description. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 shows primers for PCR amplification of the dihydro folate reductase (DHFR) 
10 deletion polymorphism region. 

Figure 2 shows the genotypes of the DHFR 19 basepair deletion by non-denaturing 
polyacrylamide gel electrophoresis. Lanes 1 and 2 show genotypes 1,1. Lanes 3 and 4 
show genotypes 1, 2. Lanes 5 and 6 show genotypes 2,2. Lane 7 shows phiX174 RF 
DNA/HaeHI size markers from BRL Life Technologies. 

1 5 Figure 3 shows the sequences of PCR amplification products in the Region of the 
DHFR polymorphism region. * is explained in Text, see Example 2. 

Figure 4A is a nucleotide sequence of the wild type human DHFR, (SEQ ID NO:41) 
from Yang et al, J. Mol Biol 176:169-187 (1984), GeneBank accession no: X00855. 
The start codon is in bold. Figure 4B is the same nucleotide sequence as that of 
20 Figure 4A except the deletion of the 1 9 nucleotides due to the DHFR deletion 
polymorphism, (SEQ ID NO:42). 
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DETAILED DESCRIPTION OF THE INVENTION 



PCT/US00/14354 



The present invention in its broadest embodiment provides a method of diagnosing, 
preventing and/or treating specific physiological/developmental disorders. Such 
physiological/developmental disorders include schizophrenia, spina bifida cystica, 
5 Tourette's syndrome, bipolar illness, autism, conduct disorders, attention deficit 
disorder, obsessive compulsive disorder, chronic multiple tic syndrome and learning 
disorders such as dyslexia. 

A particular aspect of the present invention provides methodology for diagnosing, 
preventing and/or treating a developmental disorder such as schizophrenia. Such 

1 0 methodology is premised on the correlation between abnormalities in folate, 

cobalamin, and/or pyridoxine metabolism in an individual and/or the mother of an 
individual and the occurrence of the developmental disorder, e.g., schizophrenia in 
the individual. Further, the present invention provides a framework {i.e., the gene- 
teratogen model, and the DNA Polymorphism-Diet-Cofactor-Development both of 

1 5 which are described in detail below) which fully explain the rationale for the 

correlation, though the ultimate usefulness of the methods of the present invention are 
independent of any particular model. 

Within this context, the DNA Polymorphism-Diet-Cofactor-Development model 
maintains that a developmental disorder such as schizophrenia results in part from 

20 developmental brain damage sustained in utero due to maternal dietary deficiency of 
folate, pyridoxine or cobalamin potentiated by the aggregate effect of minor defects 
of folate, pyridoxine or cobalamin genes. The maternal damage to the fetus can result 
in part from insufficiency of the folate, pyridoxine and cobalamin themselves and/or 
from resulting effects such as immune deficiency and maternal teratogens, e.g. 

25 hyperhomocysteinemia. Genes from either parent acting in the fetus may modify 
these damaging effects as exemplified in the gene-teratogen model, below. 

As described herein the present invention can be practiced on a case by case basis, or 
alternati vely, it can be used in the screening of the general population, or within any 
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particular subgroup, such as newborns (as is presently performed in the diagnosis and 
treatment of hyperphenylalaninemia). 

Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

5 As used herein a "gene involved in folate, pyridoxine, or cobalamin metabolism" is a 
gene that encodes a peptide or protein that plays a role in a pathway involved in either 
folate, pyridoxine, or cobalamin metabolism. An incomplete listing of examples of 
such proteins is given in Tables 2-7. 

As used herein the term "individual" includes a fetus, infant, child, adolescent, and 
10 adult. Therefore, as used herein, an individual originates at conception. 

As used herein an individual with a susceptibility for "having offspring that develop a 
developmental disorder" is meant to be indicative of the susceptibility of the 
offspring of that individual to develop the developmental disorder and is not in any 
way meant to be indicative of the susceptibility of the individual to have offspring. 

15 The term "proband" as used herein is operationally defined by Table 8 along with the 
accompanying explanatory information {see, Example 1). For most purposes, the 
proband can be considered the central figure in the familial analysis, the remaining 
individuals in the family being designated as "blood relatives". There are three types 
of probands: (1) an "affected proband" i.e., an individual that is believed to have a 

20 developmental disorder ; (2) a "control proband" an individual that is believed not to 
have a developmental disorder; and (3) a "diagnostic proband" i.e., an individual 
being diagnosed. 

As used herein a "blood relative" of an individual is a relative that is related to the 
individual in a genetic sense. Blood relatives can include mothers, fathers, children, 
25 uncles, aunts, brothers, sisters, and grandparents. Preferably a blood relative is a 
parent, a sibling, or a grandparent. Adopted relatives, step-parents, relatives through 
marriage and the like are not blood relatives. Therefore, as used herein, the terms 
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"mother", "father", "sibling", "grandparent", "grandfather" and "grandmother" are 
indicative of blood relationships. 

As used herein a "mate of an individual" is a person whose genetic material is 
combined with that of the individual for the conception of the offspring in question. 

5 As used herein the term "schizophrenia" describes a disorder that is at least partially 
due to one or more genetic mutations or polymorphisms in one or more genes 
involved in folate, cobalamin or pyridoxine metabolism in an individual that is 
schizophrenic and/or to one or more genetic mutations or polymorphisms in one or 
more genes involved in folate, cobalamin or pyridoxine metabolism in the mother of 

10 that individual. 

As used herein an individual is "schizophrenic" when the individual displays 
symptoms that would be accepted by an experienced psychiatrist to merit a diagnosis 
of schizophrenia. Such a diagnosis is based, at least in part, on the currently evolving 
guidelines for the diagnosis of schizophrenia which are listed in the successive 
1 5 editions of Diagnostic and Statistical Manual for Mental Disorders, put out by the 
American Psychiatric Association. The current edition is the DSM, Fourth Edition 
(1994). 

As used herein the terms "spina bifida cystica", 'Tourette's syndrome", "bipolar 
illness", "autism", "conduct disorder", "attention deficit disorder", "obsessive 

20 compulsive disorder", "chronic multiple tic syndrome" and "learning disorders" such 
as "dyslexia"describe disorders which display symptoms that would be accepted by 
an experienced psychiatrist to merit a diagnosis of that disorder. Such a diagnosis is 
based, at least in part, on the currently evolving guidelines which are listed in the 
successive editions of Diagnostic and Statistical Manual for Mental Disorders, put out 

25 by the American Psychiatric Association. The current edition is the DSM, Fourth 
Edition (1994). 

As used herein the term "teratogenic locus" indicates one or more alleles that act in a 
pregnant woman to cause an intrauterine teratogenic effect on the fetus. 
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As used herein the terms "specificity locus" or "modifying locus" are used 
interchangeably and are indicative of one or more alleles that can act during 
pregnancy and/or after birth to prevent, modify, and/or ameliorate the teratogenic 
effect of the teratogenic locus. 

5 As used herein a "constellation of genetic mutations" is the set of genetic risk factor 
mutations that is present in a proband and relatives of the proband. One example of a 
constellation of genetic mutations is shown in a line of Table 8, below. 

As used herein a "risk factor" is a teratogen or substance (including a defective gene) 
that can lead to a teratogenic effect that is present or suspected of being present in a 
1 0 tissue sample or body fluid of an individual's mother during the individual's gestation 
and/or present or suspected of being present in a tissue sample or body fluid of the 
individual. 

As used herein a "genetic risk factor" is used interchangeably with the term "genetic 
explanatory variable" and is a genetic mutation and/or polymorphism that causes or 
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The current (developmental) model for schizophrenia is that genetic and 
environmental factors cause brain damage in a fetus that later develops schizophrenia. 
However, the genetic and environmental factors have not been identified. Also, 
extensive linkage and association studies have failed to identify genes determining 
5 schizophrenia. The reasons usually given for this difficulty include: (i) locus 
heterogeneity, i.e., more than one gene locus is involved, perhaps many gene loci 
each with a small effect; (ii) the mode of inheritance of schizophrenia is unknown; 
and (iii) an additional possible factor is that the frequency of the disease alleles may 
be high, thus greatly reducing the power of linkage studies. 

10 The DNA Polymorphism-Diet-Cofactor-Development model explains all of these 
difficulties and at the same time proposes a unified metabolic abnormality. The 
unified metabolic abnormality is: (a) ENVIRONMENTAL, i.e., due to a 
folate/cobalamin/pyridoxine deficiency caused by either decreased ingestion or 
increased requirement during pregnancy; (b) GENETIC, i.e., due to a 

15 folate/cobalamin/pyridoxine genetic defect caused by the aggregate effect of multiple 
mutations of folate/cobalamin/pyridoxine genes each individually having a small 
effect; and (c) the interaction of the folate/cobalamin/pyridoxine environmental and 
genetic factors (indicated above) to cause other harmful effects such as maternal 
teratogens and immune deficiency during gestational development. Different gene 

20 loci and different combinations of gene loci will be involved in different patients and 
different families. The problem of locus heterogeneity is addressed by the hypothesis 
that the folate/cobalamin/pyridoxine genetic defect is the aggregate effect of multiple 
mutations of folate/cobalamin/pyridoxine genes each of which have a relatively small 
effect. 

25 The problem of mode of inheritance is addressed by the gene-teratogen model. The 
gene-teratogen model describes the special features of genes acting in utero; both 
teratogenic and modifying of specificity loci may be involved. If these effects are not 
taken into account, the assignment of affection status in schizophrenia pedigrees is 
inaccurate. Assignment of affection status is a key element in defining the mode of 

30 inheritance for all kinds of linkage mapping. Failure to assign the correct mode of 
inheritance is another factor that has made the linkage studies very difficult. 
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Finally, the DNA Polymorphism-Diet-Cofactor-Development model proposes that 
some of the genetic factors for schizophrenia are common in the population. In fact, 
subclinical deficiency of folate, pyridoxine, and cobalamin is common in the 
population and common among pregnant women as well. Pregnancy further 
increases the requirement for folate, pyridoxine, and cobalamin. Common genetic 
polymorphisms of folate and cobalamin genes are also known, some of them 
functional. Common genetic risk factors tend to be functional polymorphisms and/or 
mutant alleles that individually have small effects. Otherwise, they would be largely 
eliminated from the population by natural selection and would not be common. High 
disease allele frequency is yet another factor that greatly diminishes the power of a 
linkage study. 

Besides explaining the difficulties with current linkage studies, the DNA 
Polymorphism-Diet-Cofactor-Development model explains all of the unusual 
biological and epidemiological features of schizophrenia: e.g. the decreased amount 
of gray matter in brain areas, the unusual birth-month effect, the geographical 
differences in incidence, the socioeconomic predilection, the association with 
obstetrical abnormalities (low birth weight and prematurity), and the association with 
famine and viral epidemics. Consistently, genetic linkage and cytogenetic studies in 
schizophrenia have implicated various chromosome regions, some of them containing 
folate, pyridoxine, and cobalamin genes including dihydrofolate reductase, 
thymidylate synthase, and transcobalamin II. The DNA 

Polymorphism-Diet-Cofactor-Development model predicts that folate, pyridoxine, or 
cobalamin gene mutations have a high frequency in schizophrenia patients or family 
members. Furthermore, mothers of schizophrenics are predicted to be particularly 
susceptible to producing one or more teratogens during pregnancy. 

The present invention therefore provides methods for: (a) Diagnostic testing of 
schizophrenia by identifying a folate, pyridoxine, or cobalamin gene mutation or 
constellation of mutations in the patient, mother, and father, (b) Prevention of 
schizophrenia by diagnostic testing in families already affected by schizophrenia or 
by diagnostic population screening for folate mutations and identifying couples at risk 
for producing schizophrenic offspring. These pregnancies can be further monitored 
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for risk factors, e.g. dietary folate/pyridoxine/cobalamin, plasma 
folate/pyridoxine/cobalamin, or red blood cell folate; plasma homocysteine or other 
teratogens, (c) Therapy for schizophrenia, e.g., treating the pregnant mother with 
folate, pyridoxine, cobalamin or other agents. The treatment can be monitored at 
5 regular intervals to determine the effect of therapy, (d) Presymptomatic treatment of 
schizophrenia on young children found to be susceptible to schizophrenia by 
diagnostic testing for folate gene mutations and other risk factors can also be treated 
with methylfolate or related therapeutic modalities to forestall the appearance of 
schizophrenia symptoms in adolescence or adulthood. 

10 Empirical studies with methylfolate treatment of schizophrenia have shown modest 
clinical improvement. The DNA Polymorphism-Diet-Cofactor-Development model 
gives a rationale for such therapy as well as for intensive testing of related therapeutic 
modalities. Genetic testing will need to be carried out in such patients to gauge their 
likelihood of responding to therapy. In addition, the DNA 

1 5 Polymorphism-Diet-Cofactor-Development model gives direction and impetus 
toward uncovering the mechanism of fetal brain damage leading to schizophrenia. 

Diagnostic testing for schizophrenia can involve testing not just the patient, but 
mother and father as well, for not just one factor but multiple genetic factors. For 
example, data for two gene loci (both folate-related genes) were used in Example 2. 
20 In this case, there were only four explanatory variables for each comparison. 

In addition, risk factors appearing only during pregnancy may play a role, e.g. dietary 
folate which can be further monitored during the pregnancy. In certain instances, 
genotype data can be used as the sole explanatory variables, particularly in the case 
when no environmental explanatory variables are known. In such a case, the 
25 predicted probabilities will be only for the genetic component of the proband's risk of 
schizophrenia. In addition, schizophrenia mothers, fathers, and sibs do not 
necessarily have to come from the same families as the schizophrenia probands, as 
described in Example 2. 
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Of course certain genetic factors will turn out to be more common than others. This 
may simplify testing somewhat. Also some genetic factors may operate chiefly in the 
mother, while others will operate chiefly in the schizophrenic patient. This may also 
simplify testing. There are some approaches to assessing risk factors during a past 
pregnancy, e.g. current dietary history as an indicator of past diet, methionine loading 
as in indicator pf how susceptible a mother is to raising her plasma homocysteine, 
assessment of other risk factors besides folate metabolism that may affect pregnancy 
outcome. Procedures including all of these variables are both envisioned and 
included in the present invention. 

Thus the present invention provides a method of diagnosis of schizophrenia. In one 
aspect of the invention, diagnostic testing for genetic susceptibility to schizophrenia 
determines the probability that the proband is affected with schizophrenia due to 
genetic factors. This is carried out by genetic testing of a patient suspected of having 
schizophrenia and/or whatever informative relatives are available, e.g. mother, father, 
sibs, or children. The genotypes of certain folate and/or cobalamin and/or pyridoxine 
gene mutations or constellation of mutations (folate and/or cobalamin and/or 
pyridoxine gene mutations) are determined for each individual. 

Since the abnormal phenotype of schizophrenia can be determined by both genetic 
and environmental factors and since other genetic factors besides 
folate/cobalamin/pyridoxine gene mutations may be involved, the presence of 
folate/cobalamin/pyridoxine gene mutations may be neither necessary nor sufficient 
to cause schizophrenia. Thus, an unaffected individual may have the same genetic 
risk factors as an affected individual but may lack sufficient environmental factors to 
cause the abnormal clinical disease. Also, an affected individual may lack 
folate/cobalamin/pyridoxine gene mutations but may have other related or non-related 
genetic risk factors that caused the schizophrenia. 

Therefore folate/cobalamin/pyridoxine gene mutations are used as explanatory 
variables (genetic risk factors) to calculate the predicted probability that an individual 
has genetic susceptibility to schizophrenia due to these mutations. Genetic variation 
can be expected to account for approximately about half of the risk of developing 



WO 00/71754 PCT/US00/14354 

25 

Of course certain genetic factors will turn out to be more common than others. This 
may simplify testing somewhat. Also some genetic factors may operate chiefly in the 
mother, while others will operate chiefly in the schizophrenic patient. This may also 
simplify testing. There are some approaches to assessing risk factors during a past 
5 pregnancy, e.g. current dietary history as an indicator of past diet, methionine loading 
as in indicator of how susceptible a mother is to raising her plasma homocysteine, 
assessment of other risk factors besides folate metabolism that may affect pregnancy 
outcome. Procedures including all of these variables are both envisioned and 
included in the present invention. 

10 Thus the present invention provides a method of diagnosis of schizophrenia. In one 
aspect of the invention, diagnostic testing for genetic susceptibility to schizophrenia 
determines the probability that the proband is affected with schizophrenia due to 
genetic factors. This is carried out by genetic testing of a patient suspected of having 
schizophrenia and/or whatever informative relatives are available, e.g. mother, father, 

15 sibs, or children. The genotypes of certain folate and/or cobalamin and/or pyridoxine 
gene mutations or constellation of mutations (folate and/or cobalamin and/or 
pyridoxine gene mutations) are determined for each individual. 

Since the abnormal phenotype of schizophrenia can be determined by both genetic 
and environmental factors and since other genetic factors besides 

20 folate/cobalamin/pyridoxine gene mutations may be involved, the presence of 

folate/cobalamin/pyridoxine gene mutations may be neither necessary nor sufficient 
to cause schizophrenia. Thus, an unaffected individual may have the same genetic 
risk factors as an affected individual but may lack sufficient environmental factors to 
cause the abnormal clinical disease. Also, an affected individual may lack 

25 folate/cobalamin/pyridoxine gene mutations but may have other related or non-related 
genetic risk factors that caused the schizophrenia. 

Therefore folate/cobalamin/pyridoxine gene mutations are used as explanatory 
variables (genetic risk factors) to calculate the predicted probability that an individual 
has genetic susceptibility to schizophrenia due to these mutations. Genetic variation 
30 can be expected to account for approximately about half of the risk of developing 



WO 00/71754 PCT/US00/14354 

26 

schizophrenia since the concordance rate in identical twins has been estimated to be 
about 50%. The other half of the risk results from environmental factors due to their 
different positions in the uterus and to differences in the blood supply. The use of 
environmental factors as additional explanatory variables enhances this probability 
calculation, although this environmental data is more difficult to gather. Together, 
using both genetic and environmental explanatory variables, the predicted probability 
that an individual is schizophrenic may approach 1 .0. 

One likely situation for the use of the present methodology is in the diagnosis of a 
patient that has developed a psychosis. In such a case, the clinician is likely to be 
interested in determining the probability that this individual has schizophrenia. The 
number of blood relatives (preferably first degree relatives) of the patient-to-be 
diagnosed, both unaffected and affected, could then be determined. The number of 
these who would contribute a blood sample for analysis, for example, could then be 
ascertained. It is preferable that the patient-to-be-diagnosed also contributes a blood 
sample, however in certain situations, this may not be an option. The availability of 
dietary and epidemiological information for environmental explanatory variables, 
especially from the patient and the mother, can also ascertained. Of course all 
relevant legal and ethical rules should be followed regarding informed consent for the 
genetic testing. 

Biological samples such as tissue or fluid samples (e.g., 7 ml of blood in an 
EDTA-containing vacutainer, see Example 2, below), and obtainable environmental 
data from the patient and family members are then collected. DNA is extracted from 
the sample and genotypes for alleles of folate and/or cobalamin and/or pyridoxine 
genes are determined. The methods for genotyping depend upon the specific genetic 
markers used as explanatory variables. The methods for allele determination for two 
genetic markers are discussed in the Examples below. 

Data of the genetic and environmental explanatory variables for the 
patient-to-be-diagnosed (proband) and participating family members are added to a 
reference data set preferably consisting of well-defined schizophrenia probands and 
family members, and control probands, and family members for whom data is 
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available for many explanatory variables. As an approximation the control probands 
themselves also can be used as the controls for each proband family member class as 
shown in Example 2, below. Thus, as an approximation the control probands can be 
used as controls for the affected probands; and/or separately for the mothers of 
5 affected probands; and/or separately for the fathers of affected probands, etc. Another 
example of a use of the control probands is in the evaluation and/or analysis of a 
particular diagnostic proband. In this case, the approximation is obtained by adding 
the diagnostic proband to the group of affected probands and control probands. 

A model is then created consisting of the explanatory variables actually available 
10 from specific patient-to-be diagnosed and family members participating in the testing. 
This new combined data set (reference data set and data from patient-to-be-diagnosed 
with participating family members) is analyzed by binary logistic regression (e.g., 
using a statistical software package such as the SAS System embodied in Example 1 
below, though other programs may be used) for the model chosen giving the 
1 5 predicted probability that a proband is affected with schizophrenia for all of the 
probands including the patient-to-be-diagnosed. 

In a particular embodiment the model is modified and the goodness of fit for the 
patient-to-be-diagnosed is checked. The predicted probability that the 
patient-to-be-diagnosed has schizophrenia is compared with a classification table 
20 generated from the model used to determine the likelihood of false positives and false 
negatives. 

The predicted probability that the patient-to-be-diagnosed is affected with 
schizophrenia, with the likelihood of false positive or false negative result, can then 
be forwarded to the clinician. 

25 The methods for determining an individual's risk for developing schizophrenia taught 
by the present invention can be used in a variety of settings. For example, the present 
invention also provides a therapy for schizophrenia. Empirical studies with 
methylfolate treatment of schizophrenia have shown modest clinical improvement. 
The DNA Polymorphism-Diet-Cofactor-Development model provides a rationale for 
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such therapy as well as for intensive testing of related therapeutic modalities, e.g. 
other cofactors such as cobalamin or pyridoxine. In addition, the DNA 
Polymorphism-Diet-Cofactor-Development model gives direction and impetus 
toward uncovering the mechanism of fetal brain damage leading to schizophrenia. Of 
course such therapy also can be provided on a case by case basis in order to gauge the 
likelihood of the patient of responding to such therapy, with the methodology for 
diagnosis of the present invention enabling the skilled practitioner to assess that 
likelihood. 

In addition, the present invention provides a method of identifying individuals that 
are likely to be aided by presymptomatic treatment for schizophrenia. For example, 
young children found to have a high risk for susceptibility to schizophrenia by 
diagnostic testing can be treated with methylfolate or related therapeutic modalities to 
forestall the appearance of schizophrenia symptoms in adolescence or adulthood. 
The present invention further provides methodology for diagnostic testing for specific 
families already affected by schizophrenia. 

The present invention further provides methodology for population screening for 
folate/cobalamin/pyridoxine mutations to help identify couples at risk for producing 
schizophrenic offspring. Subsequent or concurrent pregnancies can then be 
monitored for environmental risk factors, and treated with folate, cobalamin, 
pyridoxine or other agents and monitored at intervals for the effect of therapy. Such 
monitoring can include measuring levels of folate, cobalamin, pyridoxine or 
homocysteine in a particular tissue and/or fluid sample, such as blood. 

Since schizophrenia is a developmental disorder, it is likely that these same risk 
factors discussed here for schizophrenia could play a role in other developmental 
disorders including spina bifida cystica, Tourette's syndrome, learning disorders 
including dyslexia, conduct disorder, attention-deficit hyperactivity disorder, bipolar 
illness, autism, and obsessive-compulsive disorder. Interestingly, the mode of 
inheritance of these disorders, like that of schizophrenia, has been difficult to 
determine despite the fact that a genetic component to the etiology of each has been 
documented. Therefore, methodology analogous to that exemplified herein for 
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forestall the appearance of schizophrenia symptoms in adolescence or adulthood. 
The present invention further provides methodology for diagnostic testing for specific 
families already affected by schizophrenia. 

The present invention further provides methodology for population screening for 
folate/cobalamin/pyridoxine mutations to help identify couples at risk for producing 
schizophrenic offspring. Subsequent or concurrent pregnancies can then be 
monitored for environmental risk factors, and treated with folate, cobalamin, 
pyridoxine or other agents and monitored at intervals for the effect of therapy. Such 
monitoring can include measuring levels of folate, cobalamin, pyridoxine or 
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documented. Therefore, methodology analogous to that exemplified herein for 
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schizophrenia can be readily adapted for diagnosing and/or treating other such 
developmental disorders. 

Nucleic Acids 

In accordance with the present invention there may be employed conventional 
5 molecular biology, microbiology, and recombinant DN A techniques within the skill 
of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook, 
Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition 
(1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (herein 
"Sambrook et ai, 1989"); DNA Cloning: A Practical Approach, Volumes I and II 
10 (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid 
Hybridization [B.D. Hames & S.J. Higgins eds. (1985)]; Transcription And 
Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; Animal Celt Culture [R.I. 
Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; 
B. Perbal, A Practical Guide To Molecular Cloning (1984); F.M. Ausubel et al 
1 5 (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1 994)]. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester analogs thereof, such as 

20 phosphorothioates and thioesters, in either single stranded form, or a double-stranded 
helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. 
The term nucleic acid molecule, and in particular DNA or RNA molecule, refers only 
to the primary and secondary structure of the molecule, and does not limit it to any 
particular tertiary forms. Thus, this term includes double-stranded DNA found, inter 

25 alia, in linear or circular DNA molecules including restriction fragments, plasmids, 
and chromosomes. In discussing the structure of particular double-stranded DNA 
molecules, sequences may be described herein according to the normal convention of 
giving only the sequence in the 5' to 3' direction along the nontranscribed strand of 
DNA (i.e., the strand having a sequence homologous to the mRNA). A "recombinant 
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DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation. 

A nucleic acid molecule is f, hybridizable M to another nucleic acid molecule, such as a 
cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
5 molecule can anneal to the other nucleic acid molecule under the appropriate 

conditions of temperature and solution ionic strength (see Sambrook et al, supra). 
The conditions of temperature and ionic strength determine the "stringency" of the 
hybridization. High stringency hybridization conditions correspond to 50% 
formamide, 5x or 6x SSC. Hybridization requires that the two nucleic acids contain 
10 complementary sequences, although depending on the stringency of the hybridization, 
mismatches between bases are possible. The appropriate stringency for hybridizing 
nucleic acids depends on the length of the nucleic acids, the GC percentage, and the 
degree of complementation, variables well known in the art. The greater the degree 
of similarity or homology between two nucleotide sequences, the greater the value of 
1 5 T m for hybrids of nucleic acids having those sequences. The relative stability 

(corresponding to higher T m ) of nucleic acid hybridizations decreases in the following 
order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater than 100 
nucleotides in length, equations for calculating T m have been derived {see Sambrook 
et ai t supra, 9.50-10.51). For hybridization with shorter nucleic acids, i.e., 
20 oligonucleotides, the position of mismatches becomes more important, and the length 
of the oligonucleotide determines its specificity (see Sambrook et aL, supra, 1 1.7- 
11.8). Preferably a minimum length for a hybridizable nucleic acid (e.g., a nucleotide 
probe or primer such as a PCR or RT-PCR primer) is at least about 12 nucleotides; 
preferably at least about 18 nucleotides; and more preferably the length is at least 
25 about 27 nucleotides; and most preferably at least about 36 nucleotides. Specific 
probes and primers that can be used to distinguish specific variants of the nucleic 
acids encoding the proteins involved in folate, pyridoxine, and/or cobalamin 
metabolism are also part of the present invention. 

Such nucleotide probes and primers can be labeled or used to label complementary 
30 DNA (where appropriate) by any number of ways well known in the art including 
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using a radioactive label, such as 3 H, l4 C, 32 P, or 35 S, a fluorescent label, a boron label 
[U.S. Patent No: 5,595,878, Issued January 21, 1997 and U.S. Patent No: 5,876,938, 
Issued March 2, 1999 which are incorporated by reference in their entireties], and 
enzymatic tags such as urease, alkaline phosphatase or peroxidase. In the case of 
5 enzyme tags, colorimetric indicator substrates are known which can be employed to 
provide a means visible to the human eye or spectrophotometrically, to identify 
specific hybridization with complementary nucleic acid-containing samples. 

In a specific embodiment, the term "standard hybridization conditions" refers to a T m 
of 55 °C, and utilizes conditions as set forth above e.g., 5X SSC. In a preferred 
10 embodiment, the T m is 60 °C; in a more preferred embodiment, the T m is 65 °C. 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed 
and translated into a polypeptide in a cell in vitro or in vivo when placed under the 
control of appropriate regulatory sequences. The boundaries of the coding sequence 
are determined by a start codon at the 5' (amino) terminus and a translation stop 

15 codon at the 3' (carboxyl) terminus. A coding sequence can include, but is not 
limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA 
sequences from eukaryotic (e.g., mammalian) DNA, and even synthetic DNA 
sequences. If the coding sequence is intended for expression in a eukaryotic cell, a 
polyadenylation signal and transcription termination sequence will usually be located 

20 3' to the coding sequence. 

"Transcriptional and translational control sequences" are DNA regulatory sequences, 
such as promoters, enhancers, terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation 
signals are control sequences. 



25 



A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase and initiating transcription of a downstream (3' direction) coding 
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25 A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase and initiating transcription of a downstream (3' direction) coding 
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sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence 
5 will be found a transcription initiation site (conveniently defined for example, by 
mapping with nuclease SI), as well as protein binding domains (consensus 
sequences) responsible for the binding of RN A polymerase. 

A "signal sequence' 1 is included at the beginning of the coding sequence of a protein 
to direct the protein to a particular site/compartment in the cell such as the surface of 
10 a cell. This sequence encodes a signal peptide, N-terminal to the mature polypeptide, 
that directs the host cell to translocate the polypeptide. The term "translocation signal 
sequence" is used herein to refer to this sort of signal sequence. Translocation signal 
sequences can be found associated with a variety of proteins native to eukaryotes and 
prokaryotes, and are often functional in both types of organisms. 

15 Identification of Genetic Mutations 

A biological sample can be obtained from an individual and/or a blood relative of the 
individual, and from appropriate controls, using a sample from any body component 
including tissue punches, body fluids, and hair, as long as the biological sample 
contains nucleic acids and/or proteins/peptides. Thus the DNA, mRNA, proteins or 
20 peptides of the biological sample can be used to identify mutations and/or variants in 
genes involved in folate, pyridoxine, or cobalamine metabolism. The present 
invention therefore includes methods of detecting and quantifying these nucleic acids 
and/or proteins/peptides that can be used to identify genetic risk factors. 

In a particular embodiment the DNA is extractable. A particularly useful source of 
25 DNA is blood. For example, 2.5- 40 mis of blood can be collected in a vacutainer 
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containing EDTA. The blood sample is placed on ice and then centrifuged to separate 
plasma, red cells, and buffy coat. The separated fractions are then frozen at -80 °C. 

The DNA can be isolated from the buffy coat by a number of procedures well known 
in the art including using a QIAmp column DNA extraction procedure or the 
5 QIAGEN Genomic-tip method. The isolated DNA can be digested with a series of 
restriction enzymes, for example, and then the digested products can be hybridized 
with one or more particular nucleic acid probes designed from a particular gene to 
identify the gene and preferably to test for particular genetic mutations. 

Preferably the genomic DNA can be amplified by PGR using appropriate primer pairs 
10 such as the primer pairs for the MTHFR or DHFR genes which were used in the 
Example below. The PGR amplified product can be sequenced directly, or 
alternatively be digested with one or more appropriate restriction enzymes. The 
resulting digested products can be separated e.g., by column chromatography, or 
preferably by polyacrylamide or agarose gel electrophoresis. The isolated digestion 
1 5 products can be compared e.g. , by previously determined restriction maps, and/or 
alternatively, the digestion products can be sequenced directly. Alternatively, as in 
the case of DHFR, genetic polymorphisms can be detected through the use of 
restriction enzymes. 

Although a restriction map of a gene is sufficient for the employment of the methods 
20 disclosed herein, in preferred embodiments the nucleotide sequences of the genes 
used in the testing steps are known. To this end a large sampling of such sequences 
are provided in Tables 2-7. (These sequences may also be used in the design of 
restriction maps.) Thus, initially each gene whether used separately or used in a 
constellation of genes is characterized by the sequencing of the wild type gene, 
25 preferably including the coding regions, introns, control sequences, and other non- 
coding regions. In addition, mutations of such genes found in the general population 
can also be characterized. With the recent advances in the sequencing of the human 
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genome the present invention contemplates that additional sequence information will 
become publicly available, particularly with regard to mutations in relevant introns, 
and control sequences etc. which are not available in cDNA libraries. Such sequence 
information is fully envisioned to be incorporated into the on-going compilations of 
5 relevant DNA sequence databases of the present invention, as well as for its parallel 
use in the general methodology described herein. Thus DNA or mRNA or cDNA 
made from the mRNA can be used to identify mutations and/or variants in genes 
involved in folate, pyridoxine, or cobalamine metabolism. 

There are many methods currently known in the art to identify variant/mutant DNA, 
10 all of which may be used in the present invention (see e.g., internet address 

http://www.ich.bpmf.ac.uk/cmgs/mutdet.htm). Such methods include but in no way are 
limited to direct sequencing, array sequencing, matrix-assisted laser 
desorption/ionization time-of-flight mass spectrometry (Malditof) [Fitzgerald et al. y 
Ann. Rev. Biophy. Biomol Struct. 24:1 17-140 (1995)], Polymerase Chain Reaction 
15 "PCR", reverse-transcriptase Polymerase Chain Reaction "RT-PCR", RNAase 

protection assays, Array quantitation e.g., as commercially provided by Affymetrix, 
Ligase Chain Reaction or Ligase Amplification Reaction (LCR or LAR), 
Self-Sustained Synthetic Reaction (3SR/NASBA), Restriction Fragment Length 
Polymorphism (RFLP),Cycling Probe Reaction (CPR), Single-Strand Conformation 
20 Polymorphism (SSCP), heteroduplex analysis, hybridization mismatch using 

nucleases (e.g., cleavase), Southern, Northerns, Westerns, South Westerns, ASOs, 
Molecular beacons, footprinting, and Fluorescent In Situ Hybridization (FISH). Some 
of these methods are briefly described below. 

PCR is a method for increasing the concentration of a segment of target sequence in a 
25 mixture of genomic DNA without cloning or purification. PCR can be used to 
directly increase the concentration of the target to an easily detectable level. This 
process for amplifying the target sequence involves introducing a molar excess of two 
oligonucleotide primers which are complementary to their respective strands of the 
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double-stranded target sequence to the DNA mixture containing the desired target 
sequence. The mixture is denatured and then allowed to hybridize. Following 
hybridization, the primers are extended with polymerase so as to form complementary 
strands. The steps of denaturation, hybridization, and polymerase extension can be 

5 repeated in order to obtain relatively high concentrations of a segment of the desired 
target sequence. The length of the segment of the desired target sequence is 
determined by the relative positions of the primers with respect to each other, and, 
therefore, this length is a controllable parameter. Because the desired segments of the 
target sequence become the dominant sequences (in terms of concentration) in the 

10 mixture, they are said to be "PCR-amplified." [Mullis (U.S. Patent No. 4,683,195) 
and Mullis et al. (U.S. Patent No. 4,683,202)] 

In Ligase Chain Reaction or Ligase Amplification Reaction (LCR or LAR) four 
oligonucleotides, two adjacent oligonucleotides which uniquely hybridize to one 
strand of target DNA, and a complementary set of adjacent oligonucleotides, which 

1 5 hybridize to the opposite strand are mixed and DNA ligase is added to the mixture. 
Provided that there is complete complementarity at the junction, ligase will covalently 
link each set of hybridized molecules. Importantly, in LCR, two probes are ligated 
together only when they base-pair with sequences in the target sample, without gaps 
or mismatches. Repeated cycles of denaturation, hybridization and ligation amplify a 

20 short segment of DNA. [Barany, Proc. Natl. Acad. Sci., 88:189 (1991); Barany, PCR 
Methods and Applic, 1:5 (1991); and Wu and Wallace, Genomics 4:560 (1989)] 
LCR has also been used in combination with PCR to achieve enhanced detection of 
single-base changes. Segev, PCT Public. No. W09001069 Al (1990). 

Self-Sustained Synthetic Reaction (3SR/NASBA) is a transcription-based in vitro 
25 amplification system [Guatelli et al 7 Proc. Natl Acad. ScL, 87:1874-1878, 7797 
(1990); Kwok et al y Proc. Natl Acad. Sci., 86:1173-1 177) that can exponentially 
amplify RNA sequences at a uniform temperature. The amplified RNA can then be 
utilized for mutation detection (Fahy et al y PCR Meth. AppL, 1 :25-33 (1991). In this 
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method, an oligonucleotide primer is used to add a phage RNA polymerase promoter 
to the 5' end of the sequence of interest. In a cocktail of enzymes and substrates that 
includes a second primer, reverse transcriptase, RNase H, RNA polymerase and 
ribo-and deoxyribonucleoside triphosphates, the target sequence undergoes repeated 
5 rounds of transcription, cDNA synthesis and second-strand synthesis to amplify the 
area of interest. 

RFLP can be used to detect DNA polymorphisms arising from DNA sequence 
variation. This method consists of digesting DNA with one or more restriction 
endonucleases (e.g., EcoRI) and analyzing the resulting fragments by means of 

1 0 Southern blots [Southern, E., Methods in Enzymology, 69: 1 52 (1 980)], as further 
described by Botstein, et al., Am. 1 Hum. Genet., 32:314-331 (1980) and White, et 
al., Sci. Am., 258:40-48 (1988). Since a DNA polymorphism may create or delete a 
restriction site, the length of the corresponding restriction fragment with any given 
restriction enzyme could change. Once a difference in a restriction fragment length is 

1 5 identified it can be used to readily distinguish a particular polymorphism from the 
wild type DNA. Mutations that affect the recognition sequence of the endonuclease 
will preclude enzymatic cleavage at that site, thereby altering the cleavage pattern of 
that DNA. DNAs are compared by looking for differences in restriction fragment 
lengths. A technique for detecting specific mutations in any segment of DNA is 

20 described in Wallace, et al.,[Nucl. Acids Res., 9:879-894 (1981)]. It involves 

hybridizing the DNA to be analyzed (target DNA) with a complementary, labeled 
oligonucleotide probe. Due to the thermal instability of DNA duplexes containing 
even a single base pair mismatch, differential melting temperature can be used to 
distinguish target DNAs that are perfectly complementary to the probe from target 

25 DNAs that differ by as little as a single nucleotide. In a related technique, described in 
Landegren, et al., Science, 41:1077-1080 (1988), oligonucleotide probes are 
constructed in pairs such that their junction corresponds to the site on the DNA being 
analyzed for mutation. These oligonucleotides are then hybridized to the DNA being 
analyzed. Base pair mismatch between either oligonucleotide and the target DNA at 
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the junction location prevents the efficient joining of the two oligonucleotide probes 
by DNA ligase. 

When a sufficient amount of a nucleic acid to be detected is available, there are 
advantages to detecting that sequence directly, instead of making more copies of that 

5 target, (e.g., as in PCR and LCR). Most notably, a method that does not amplify the 
signal exponentially is more amenable to quantitative analysis. Even if the signal is 
enhanced by attaching multiple dyes to a single oligonucleotide, the correlation 
between the final signal intensity and amount of target is direct. Such a system has an 
additional advantage that the products of the reaction will not themselves promote 

10 further reaction, so contamination of lab surfaces by the products is not as much of a 
concern. Traditional methods of direct detection including Northern and Southern 
blotting and RNase protection assays usually require the use of radioactivity and are 
not amenable to automation. Recently devised techniques have sought to eliminate 
the use of radioactivity and/or improve the sensitivity in automatable formats. 

15 One such example is the Cycling Probe Reaction (CPR) [Duck et aL, BioTech., 9:142 
(1990)]. CPR , uses a long chimeric oligonucleotide in which a central portion is 
made of RNA while the two termini are made of DNA. Hybridization of the probe to 
a target DNA and exposure to a thermostable RNase H causes the RNA portion to be 
digested. This destabilizes the remaining DNA portions of the duplex, releasing the 

20 remainder of the probe from the target DNA and allowing another probe molecule to 
repeat the process. The signal, in the form of cleaved probe molecules, accumulates 
at a linear rate. While the repeating process increases the signal, the RNA portion of 
the oligonucleotide is vulnerable to RNases that may carried through sample 
preparation. 

25 Single-Strand Conformation Polymorphism (SSCP) is based on the observation that 
single strands of nucleic acid can take on characteristic conformations in 
non-denaturing conditions, and these conformations influence electrophoretic 
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mobility. [Hayashi, PCR Meth, AppL, 1:34-38, (1991). The complementary strands 
assume sufficiently different structures that one strand may be resolved from the 
other. Changes in sequences within the fragment will also change the conformation, 
consequently altering the mobility and allowing this to be used as an assay for 

5 sequence variations (Orita, et al, Genomics S:874r879, (1989). The SSCP process 
involves denaturing a DNA segment (e.g., a PCR product) that is labeled on both 
strands, followed by slow electrophoretic separation on a non-denaturing 
polyacrylamide gel, so that intra-molecular interactions can form and not be disturbed 
during the run. This technique is extremely sensitive tovariations in gel composition 

1 0 and temperature. 

In Fluorescent In Situ Hybridization (FISH), specific probes are designed which can 
readily distinguish the wild-type gene from the variant/mutant gene. Such 
methodology allows the identification of a variant/mutant gene through in situ 
hybridization (U.S. Patent No. 5,028,525, Issued July 2, 1991 ; U.S. Patent No. 
15 5,225,326, Issued July 6, 1993; and U.S. Patent No. 5,501,952, Issued March 26, 
1996. FISH does not require the extraction of DNA. In addition, procedures for 
separating fetal blood cells from maternal blood cells are well known in the art 
allowing the fetus and the mother to be analyzed from the same body fluid sample 
(see U.S. Patent No: 5,629,147, Issued May 13, 1997). 

20 Similarly, antibodies raised against specific mutations and/or variants in the gene 
products of the genes involved in folate, pyridoxine, or cobalamine metabolism can 
be used to identify specific polymorphisms. Alternatively, antibodies raised against 
the wild type proteins can be used to detect and/or quantify the amount of wild type 
protein present in a given biological sample. In the case in which cross-reacting 

25 protein isn't synthesized by the cells of an individual, or is synthesized in 

significantly lower amounts than those of control subjects, such determinations can be 
used to identify a genetic risk factor. In addition, these antibodies can be used in 
methods well known in the art relating to the localization and activity of the gene 
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products, e.g., for Western blotting, imaging the proteins in situ, measuring levels 
thereof in appropriate physiological samples, etc. using any of the detection 
techniques known in the art. Furthermore, such antibodies can be used in flow 
cytometry studies, in immunohistochemical staining, and in immunoprecipitation 
5 which serves to aid the determination of the level of expression of a protein in the cell 
or tissue. 

In the particular instance when the gene product is an enzyme, e.g., dihydrofolate 
reductase, the enzymatic activity of a biological sample can be indicative of the 
presence of a genetic risk factor. In a particular embodiment, a decrease in an enzyme 
1 0 activity that is associated with folate, pyridoxine, or cobalamine metabolism can be 
indicative of the presence of the genetic risk factor. Such assays can be performed on 
multiple samples such as on a microplate reader [Widemann et ai y Clin Chem. 
45:223-228(1999)]. 

MODEL 1 

15 THE GENE-TERATOGEN MODEL FOR THE INHERITANCE PATTERN 
OF CERTAIN DEVELOPMENTAL DISORDERS 
Introduction : 

It has long been known, e.g. from extensive studies of exogenous teratogens in inbred 
mice [Finnell and Chernoff, Gene-teratagen interactions: an approach to 

20 understanding the metabolic basis of birth defects, In Pharmacokinetics in 

Teratogenesis,Vol. 11:97-109 Experimental Aspects In Vivo and In Vitro, CRC Press, 
Inc, Boca Ratan, Fl. (1987)], that teratogens may be influenced by genetic factors. It 
is less well known that the same gene defect may cause different clinical disorders 
depending upon whether the metabolic effect of the gene defect is exerted during 

25 gestation in utero or during postnatal life. However, the consequences of 

gene-teratogen interactions in human pedigrees have not been extensively explored, 
especially the consequences for the use of linkage mapping to identify an unknown 
gene acting in utero to cause a developmental disorder. A number of common human 
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developmental disorders have been shown to have a genetic component to their 
etiology. However, for certain developmental disorders, the mode of inheritance has 
been difficult to determine and linkage studies have met with unexpected difficulties 
or have achieved limited success. These developmental disorders include spina bifida 
5 cystica [Chatkupt, Am J Med Genet, 44:508-512 (1992)], Tourette's syndrome & 
related disorders, e.g. obsessive-compulsive disorder and chronic multiple tics 
syndrome [Pauls, Adv Neurol 58:151-157 (1992); McMahon et al, Adv Neurol 
58:159-165 (1992); Heutink et al, Am J Hum Genet, 57:465-473 (1995); Grice et al, 
Am J Hum Genet, 59:644-652 (1996)], learning disorders, including dyslexia [Lewis, 
10 et al, Behav Genet, 23:291-297 (1993); Pennington, J Child Neurol WSuppl, 1:S69- 
S77 (1995)], conduct disorder [Lombroso et al, J Am Acad Child Adolesc Psychiatry, 
33:921-938 (1994)], attention-deficit hyperactivity disorder [Lombroso et al, J Am 
Acad Child Adolesc Psychiatry, 33:921-938 (1994)], bipolar illness [Baron, Acta 
Psychiatr Scand, 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 40:313- 
15 316 (1996); Risch and Botstein, Nature Genet, 12:351-353 (1996); Jamison and 
Mclnnis, Nature Med, 2:521-522 (1996); Morell, Science, 272:31-32 (1996)], 
schizophrenia [Owen, Psychol Med, 22:289-293 (1992); Cloninger,^w J Med Genet, 
54:83-92 (1994); Lander and Kruglyak, Nature Genet, 11:241-247 (1995); Baron, 
Acta Psychiatr Scand, 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 
20 40:313-316 (1996); Baron, Am J Med Genet, 67:121-123 (1996)], autism [Lombroso 
et al, J Am Acad Child Adolesc Psychiatry, 33:921-938 (1994)], and 
obsessive-compulsive disorder in adults [Lombroso et al, J Am Acad Child Adolesc 
Psychiatry, 33:921-938 (1994)]. A recent article [Moldin, Nature Genet 17:127-129 
(1997)] has reviewed "The maddening hunt for madness genes." 

25 The present model addresses the question of the mode of inheritance of certain 
developmental disorders and proposes the "gene-teratogen model." The model 
suggests that the mode of inheritance of genes acting prenatally may in some cases be 
fundamentally different from that of genes acting postnatally. Even the same gene 
acting prenatally may produce a different disorder from that gene acting postnatally. 



WO 00/71754 



PCT/US00/14354 



40 

developmental disorders have been shown to have a genetic component to their 
etiology. However, for certain developmental disorders, the mode of inheritance has 
been difficult to determine and linkage studies have met with unexpected difficulties 
or have achieved limited success. These developmental disorders include spina bifida 
5 cystica [Chatkupt, Am J Med Genet, 44:508-5 12 (1992)], Tourette's syndrome & 
related disorders, e.g. obsessive-compulsive disorder and chronic multiple tics 
syndrome [Pauls, Adv Neurol 58:151-157 (1992); McMahon et al, Adv Neurol 
58:159-165 (1992); Heutink et al, Am J Hum Genet, 57:465-473 (1995); Grice et al, * 
Am J Hum Genet. 59:644-652 (1996)], learning disorders, including dyslexia [Lewis, 
10 et al, Behav Genet, 23:291-297 (1993); Pennington, J Child Neurol WSuppl 1:S69- 
S77 (1995)], conduct disorder [Lombroso et al, J Am Acad Child Adolesc Psychiatry, 
33:921-938 (1994)], attention-deficit hyperactivity disorder [Lombroso et al, J Am 
Acad Child Adolesc Psychiatry, 33:921-938 (1994)], bipolar illness [Baron, Acta 
Psychiatr Scand, 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 40:313- 
15 316 (1996); Risch and Botstein, Nature Genet, 12:351-353 (1996); Jamison and 
Mclnnis, Nature Med, 2:521-522 (1996); Morell, Science, 272:31-32 (1 996)], 
schizophrenia [Owen, Psychol Med, 22:289-293 (1992); Cloninger, Am J Med Genet, 
54:83-92 (1994); Lander and Kruglyak, Nature Genet, 11:241-247 (1995); Baron, 
Acta Psychiatr Scand, 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 
20 40:313-316 (1996); Baron, Am J Med Genet, 67:121-123 (1996)], autism [Lombroso 
et al, J Am Acad Child Adolesc Psychiatry, 33:921-938 (1994)], and 
obsessive-compulsive disorder in adults [Lombroso et al, J Am Acad Child Adolesc 
Psychiatry, 33:921-938 (1994)]. A recent article [Moldin, Nature Genet. 17:127-129 
(1997)] has reviewed "The maddening hunt for madness genes." 

25 The present model addresses the question of the mode of inheritance of certain 
developmental disorders and proposes the "gene-teratogen model." The model 
suggests that the mode of inheritance of genes acting prenatally may in some cases be 
fundamentally different from that of genes acting postnatally. Even the same gene 
acting prenatally may produce a different disorder from that gene acting postnatally. 



WO 00/71754 



PCT/USOO/14354 



41 

The inheritance pattern in the gene-teratogen model is simple, but from the 
perspective of the patient with the developmental disorder is neither dominant nor 
recessive. Some disorders regarded as multifactorial, polygenic, or oligogenic may 
have this mode of inheritance. In the gene-teratogen model, genetically determined 
5 teratogen production by the mother during pregnancy damages the fetus producing 
the abnormal phenotype of a developmental disorder. The model is illustrated with 
two types of loci, 1 . a teratogenic locus acting in the mother, and 2. a modifying or 
specificity locus acting in the fetus. Damage by the teratogen is influenced also by 
environmental factors. The model is interesting because it is simple and because 
10 teratogenic loci will be difficult to locate by parametric or non-parametric linkage 
mapping techniques due to misspecification of the affection status of both mother and 
affected children. A study design is suggested for identifying teratogenic loci. An 
example of the gene-teratogen model is the major intrauterine effect seen in offspring 
of phenylketonuric mothers. Certain developmental disorders whose mode of 
1 5 inheritance has been difficult to determine or whose genetic factors have been 

difficult to locate are candidates for the gene-teratogen model, including spina bifida 
cystica, Tourette's syndrome, learning disorders including dyslexia, conduct disorder, 
attention-deficit hyperactivity disorder, bipolar illness, schizophrenia, autism, and 
obsessive-compulsive disorder. 

20 The Gene Teratogen Model 

The model is described in Table 1 using two kinds of loci: a "teratogenic" locus and a 
"modifying" or "specificity" locus. The gene-teratogen model requires a teratogenic 
locus. One or more modifying or specificity loci may or may not be present. Also, 
two types of phenotypes are defined: 1. the teratogen-induced phenotype; and 2. the 
25 teratogenic phenotype, i.e., the phenotype of a mother that produces a teratogenic 
effect during pregnancy. The two phenotypes are different for the teratogenic locus 
but are identical for the modifying or specificity loci. 
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TABLE 1 

DIAGRAM OF THE GENE-TERATOGEN MODEL 



ijranaparenis . 


Maternal 
Grandmother 
AabbCCdd 


Maternal 
Grandfather 
AaBbCcdd 


Paternal 
Grandmother 
AAbbCcDd 


Paternal 
Grandfather 
AAbbCCdd 


Parents: 


Mother 
aaBbCcdd 


Father 
AAbbCcDd 


Child: 


Child (fetus) with developmental disorder 
AabbccDd 


locus A: 


teratogenic locus, recessive, acting in the mother to cause 
intrauterine teratogenic damage to the fetus. 


locus B: 


teratogenic locus, dominant, acting in the mother to cause 
intrauterine teratogenic damage to the fetus. 


locus C: 


modifying or specificity locus, recessive, acting in the fetus. 


locus D: 


modifying or specificity locus, dominant, acting in the fetus. 



10 The teratogenic locus may be dominant (locus A) or recessive (locus B). This locus 
acts in the mother during pregnancy to cause an intrauterine teratogenic effect in the 
fetus. The teratogenic effect may result from the production of an endogenous 
teratogen, from potentiation of an exogenous teratogen, from a metabolic deprivation 
or imbalance or from some other mechanism. Only one teratogenic locus is required; 

15 both locus A and locus B are shown on the same diagram for simplicity. A specificity 
or modifying locus may be dominant (locus C) or recessive (locus D). Such a locus 
acts during pregnancy or after to modify the extent of the developmental damage 
done by the teratogenic locus or even to prevent or repair the damage. For example, 
for a teratogen acting at a certain time in development, locus C or D may determine 

20 whether brain or kidney is damaged, which structures of the brain are damaged, or 
whether damage occurs at all. 

1. Locus A, recessive teratogenic locus, acting in the mother: The child is the patient 
with the abnormal phenotype of a specific developmental disorder, while mother, 
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done by the teratogenic locus or even to prevent or repair the damage. For example, 
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20 whether brain or kidney is damaged, which structures of the brain are damaged, or 
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/. Locus A, recessive teratogenic locus, acting in the mother: The child is the patient 
with the abnormal phenotype of a specific developmental disorder, while mother, 
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father, and grandparents do not have the abnormal phenotype of that disorder (Table 
1 ). Locus A acts in the mother during pregnancy causing her to produce the 
teratogenic effect that damages the developing fetus leading to the developmental 
disorder either in the fetus or postnatally in the child or adult. Since this locus is 

5 recessive in action, the mother, a homozygote (aa) for the disease allele, is the genetic 
"patient." Her abnormal phenotype, the "teratogenic phenotype", is the trait of 
producing the teratogenic effect during pregnancy. Her fetus, damaged by the 
teratogenic effect in utero, does develop the teratogen-induced phenotype. However, 
the fetus is only a heterozygote (Aa) at locus A and thus lacks both the abnormal 

10 homozygous genotype at locus A and the abnormal teratogenic phenotype; e.g., if the 
fetus is a daughter, she will not produce the teratogenic effect later during pregnancy. 
Thus, the fetus is affected with the developmental disorder but is not the genetic 
"patient." Locus A, acting through a teratogenic effect, cannot be the only etiological 
factor for the developmental disorder. If it were, then all pregnancies of an aa mother 

1 5 would have the teratogen-induced phenotype which is not the case. Environmental 
and/or other genetic factors, are required. An aa father will have the abnormal 
genotype, but not the abnormal teratogenic phenotype because he could never become 
pregnant. 

2. Locus B, dominant teratogenic locus acting in the mother: The situation is the 
20 same as for locus A except that locus B is dominant in action (Table 1). The mother 

has the abnormal genotype, Bb, and the abnormal teratogenic phenotype. The fetus 
has the teratogen-induced phenotype but in the instance shown (Table 1) has neither 
the abnormal genotype, the teratogenic phenotype, nor even a copy of the disease 
allele. The maternal grandfather shown (Table 1) has the abnormal genotype, Bb, but 
25 does not have the teratogenic phenotype because he could never become pregnant. 

3. Environmental effects: The teratogenic effect is modified by environmental 
factors, e.g. maternal dietary factors, infection, or ingestion of teratogen. These 
environmental factors may interact with locus A or B or may act independently. From 
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the perspective of the fetus later to develop the developmental disorder 
(teratogen-induced phenotype), intrauterine teratogenic is an environmental not a 
genetic effect. 

4. Modifying or Specificity Loci Acting in the Fetus, Loci C & D: These loci may 
5 interact with the teratogenic locus or the environmental factors to increase or decrease 
their effect, or alternatively could act independently. Such genetic factors may be 
recessive (locus C) or dominant (locus D). Genotypes and phenotypes of locus C and 
D behave conventionally with respect to the developmental disorder. For locus C and 
D, the fetus is with the developmental disorder is now the genetic "patient". Maternal 
10 teratogenic in utero is an environmental effect. It is thus possible that the same gene 
locus could act in part as a teratogenic locus and in part as a modifying or specificity 
locus. 

DISCUSSION 

The Example of Phenylketonuria: An example of the gene-teratogen model is the 
1 5 major intrauterine effect in maternal phenylketonuria (PKU). Phenylketonuria itself 
is a recessive postnatal disorder. Untreated homozygous PKU mothers and fathers 
both have elevated blood phenylalanine (hyperphenylalaninemia). However, 
heterozygous offspring of untreated PKU mothers (but not fathers) have an abnormal 
phenotype.[Koch et al f Acta Paediatr Suppl, 407:1 1 1-1 19 (1994); Allen et al. Acta 
20 Paediatr Suppl 407:83-85 (1994); Abadie et al, Archives Pediatr, 3:489-486 

(1996)]. Thus the elevated blood phenylalanine or other metabolite(s) in the mother 
acts as a teratogen for the fetus. Note that the fetus of an untreated phenylketonuric 
mother does not have the phenotype of PKU (the "teratogenic phenotype"), but has a 
different phenotype (the "teratogen-induced phenotype"). 
25 Phenylketonurics [Menkes, Textbook of Child Neurology, Lea & Febiger, 
Philadelphia (1990)] are normal at birth and develop a progressive disorder 
postnatally characterized by vomiting, eczema, seizures (infantile spasms with 
hypsarrythmia on electroencephalography), and mental retardation. The fetus of an 
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untreated phenylketonuria mother [Menkes, Textbook of Child Neurology, Lea & 
Febiger, Philadelphia (1990)] has a congenital non-progressive disorder of fetal 
origin characterized by microcephaly, abnormal facies, mental retardation, congenital 
heart disease, and prenatal and postnatal growth retardation. The PKU phenotype is a 

5 postnatal degenerative disorder; the phenotype of the PKU intrauterine effect is a 
developmental disorder. The teratogenic effect is not dependent upon the fetal 
genotype, although the fetus is an obligate heterozygote since the mother is a 
homozygote for phenylketonuria and the father (usually) has the normal genotype. 
Thus, in phenylketonuria, a mutation at the same gene locus causes two distinct 

1 0 disorders depending upon whether the period of abnormal gene 

action is prenatal or postnatal. A fetus with the abnormal homozygous genotype who 
is carried by a heterozygous mother is protected in utero, but develops PKU 
postnatally. A heterozygous fetus carried by a mother with the abnormal 
homozygous genotype is damaged in utero when the mother's genotype predominates, 

1 5 but is protected from PKU postnatally by its own genotype. 

An Example from Studies in Inbred Mice: Finnell and ChomofI [Gene-teratagen 
interactions: an approach to understanding the metabolic basis of birth defects, In 
Pharmacokinetics in Teratogenesis,Vol. 11:97-109 Experimental Aspects In Vivo and 
In Vitro, CRC Press, Inc, Boca Ratan, Fl. (1987)] have reviewed a group of elegant 

20 experiments in inbred mice documenting that differences in susceptibility to 
exogenous teratogens can be regarded as a genetic trait that is determined by 
susceptibility or liability genes of either the maternal or fetal genotype [Finnell and 
Chernoff, Gene-teratagen interactions: an approach to understanding the metabolic 
basis of birth defects, In Pharmacokinetics in Teratogenesis,Vol. 11:97-109 

25 Experimental Aspects In Vivo and In Vitro, CRC Press, Inc, Boca Ratan, Fl. (1987)]; 
Finnell et al, Am J. Med. Genet 70:303-311 (1997); Bennett et al t Epilepsia 38:415- 
423 (1997)]. For example, sensitivity to acetazolamine-induced ectrodactyly is 
determined by the presence of three genes, and the fetus must be homozygous for the 
recessive allele at all three loci in order to express the malformation. However, the 
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inbred mouse models used do not mirror the human situation in at least three respects. 
First, the human population is an outbred population compared to these inbred mouse 
models. Consequently, the relevant genotypes may be highly variable among 
members of different families. Second, the inbred mouse experiments address the 
5 question of exogenous rather than endogenous teratogens. Third, the inbred mouse 
studies rely upon known or candidate susceptibility loci, whereas in humans, the 
problem has been to locate and identify disease unknown loci largely by using linkage 
mapping techniques. 

Implications for Linkage Mapping: 

10 Teratogenic Locus (LocusA or B): The gene-teratogen model has major implications 
for linkage mapping done with either parametric or non-parametric methods. The 
problem for both methods is incorrect assignment of affection status. In the lod score 
method, a genetic model of the disease is constructed and an affection status is 
assigned to each member of the pedigree. If the genetic model specified is wrong, the 

15 linkage results may be falsely positive or falsely negative [Terwilliger and Ott, 

Handbook of Human Genetic Linkage, Johns Hopkins Univ. Pr., Baltimore (1994)]. 

In developmental disorders resulting from the gene-teratogen model, the phenotype 
assignment for lod score analysis will be incorrect. The patient with the 
developmental disorder will be assigned the affected phenotype, whereas the patient 

20 is actually affected only for the teratogen-induced phenotype, but is unaffected for the 
teratogenic phenotype. Likewise, the mother will be assigned the unaffected 
phenotype for linkage analysis. Actually, she is unaffected only for the 
teratogen-induced phenotype, but is affected for the teratogenic phenotype. Lod 
scores should increase when phenotype assignments have been corrected. However, 

25 apparently dominant inheritance may in fact turn out to be pseudodominant if the 
mutant allele is common in the population. For non-parametric analysis, a similar 
misassignment occurs. In the case of affected sib-pairs, the affected sibs will be 
assigned the affected phenotype. Actually, the sibs are affected only for the 
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teratogen-induced phenotype, but are unaffected for the teratogenic phenotype. The 
mother will be assigned the unaffected or unknown phenotype. Actually, she is 
unaffected only for the teratogen-induced phenotype but is affected for the teratogenic 
phenotype. Thus, the "affected sib-pair" families are likely to turn out to contain only 
- 5 a single sporadic case, since the only individual in the kindred affected with the 
teratogenic phenotype will be the mother. 

For the transmission/disequilibrium test (TDT) [Spielman et al, Am J Hum Genet, 
52:506-516 (1993); Ewens and Spielman, Am J Hum Genet, 57:455-464 (1995)] the 
patient with the developmental disorder will be assigned the affected phenotype. 

10 Actually, the patient will be affected only for the teratogen-induced phenotype but 
will be unaffected for the teratogenic phenotype. The mother will be assigned the 
unaffected or unknown phenotype. Actually, she is unaffected only for the 
teratogen-induced phenotype but is affected for the teratogenic phenotype. The 
expectation of TDT is that alleles of a linked locus will show distortion from random 

1 5 transmission from mother (or father) to the patient. Since the patient is unaffected for 
the teratogenic phenotype, no transmission distortion from mother (or father) to child 
will be observed. Transmission distortion for alleles of a teratogenic locus will in fact 
occur from the mother's parents to the mother, the actual patient for the teratogenic 
phenotype. But this will not be looked for because the phenotypes have been wrongly 

20 assigned. In addition, grandparents of the patients with the developmental disorder 
have probably not had DNA collected. Therefore, for the TDT, negative results may 
occur for disease alleles of a teratogenic locus because incorrect phenotype 
assignments will have been made. When correct phenotype assignments have been 
made, transmission distortion to the mother from her parents should be expected for 

25 disease alleles of a teratogenic locus. Analogous misassignments are made in allelic 
association and haplotype relative-risk analyses [Falk and Rubinstein, Ann Hu, Genet, 
51:227-233 (1987); Terwilliger and Ott, Hum Hered, 42:337-346 (1992); Thomson, 
Am J Hum Genet, 57:487-498 (1995)]. 
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Modifying or Specificity Loci (Locus C and/or D) : Since these loci behave in a 
conventional fashion, the phenotype assignments will be correct. Consequently, 
genes identified by conventional parametric or non-parametric linkage studies are 
likely to be modifying or specificity loci. An important question for linkage mapping 

5 is the relative contribution to the abnormal phenotype of the developmental disorder 
made by the teratogenic locus versus that of a modifying or specificity locus. If the 
effect of a teratogenic locus is small, then loci identified by conventional linkage 
studies will be specificity or modifying loci and the mode of inheritance will be 
Mendelian or multifactorial. If a teratogenic locus makes a major contribution to 

10 phenotype, then linkage mapping studies will not give a consistent answer and the 
mode of inheritance will be difficult to determine. 

The presence of a teratogenic locus may be suspected if the maternal contribution to 
phenotype is different from or greater than the paternal contribution. For example, 
the mother's relatives of spina bifida infants more frequently have affected children 

1 5 than the father's relatives. Suggested explanations for this observation have been 
mitochondrial inheritance, maternal effect, or genomic imprinting [Chatkupt, Am J 
Med Genet, 44:508-512 (1992)]. The operation of a teratogenic locus is another 
explanation and is itself a form of maternal effect. For a recessive teratogenic locus, 
the mother's sisters would be at greatest risk of having offspring with the 

20 teratogen-induced phenotype. 

Implications for Definition of Phenotype: All the pregnancies of a mother with the 
teratogenic phenotype are at risk for the developmental disorder, the 
teratogen-induced phenotype. Yet only a few of the fetuses will be affected by the 
developmental disorder because of the action of environmental factors and/or the 
25 modifying or specificity loci. The action of the environmental factors is fully 
quantitative: depending upon the amplitude of the environmental effect, a mild, 
moderate, or severe teratogen-induced phenotype may result. In addition, the 
environmental factor may act at different times in fetal development producing 
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qualitatively different phenotypes. Thus, quantitatively or qualitatively different 
teratogen-induced phenotypes may result from pregnancies of the same mother with 
the teratogenic phenotype. In addition, the action of the modifying or specificity loci 
may produce quantitatively or qualitatively different phenotypes in offspring of the 

5 same couple. Such different phenotypes may be diagnostically classified as different 
disorders. This may complicate attempts at associating specific loci with a 
specific teratogen-induced phenotype. All of the teratogen-induced phenotypes 
resulting from pregnancies of a mother with the teratogenic phenotype modified only 
by environmental factors are genetically indistinguishable. However, such 

1 0 teratogen-induced phenotypes affected also by the various modifying or specificity 
loci segregating among the offspring of a single couple are only partially genetically 
related. 

Methods to Identify Teratogenic Loci: One effective approach to finding a putative 
teratogenic locus is to carry out non-parametric linkage studies of families consisting 

15 of a patient affected with the developmental disorder, the patients two (unaffected) 
parents, and the patient's four (unaffected) grandparents (Table 1). In such a family, 
the mother is the genetic patient but the other family members are not. Now, the 
mother's nuclear family (the mother and her parents) is compared with the father's 
nuclear family (the father and his parents). In a haplotype relative risk study, the 

20 disease allele(s) of the teratogenic locus will occur more frequently in the mother 
compared with other alleles of her parents; the disease allele(s) of the teratogenic 
locus will not occur more frequently in the father compared with other alleles of his 
parents. In a transmission/disequilibrium test, transmission distortion will be seen for 
the disease allele(s) of a teratogenic locus in the mother's nuclear family but not in the 

25 father's nuclear family. In an allelic association study, the disease allele will occur 
more frequently in mothers, patients (with the developmental disorder), and patient's 
sibs (both affected and unaffected) than in unrelated control individuals. Disease 
allele frequency in fathers will not be distinguishable from that in control individuals. 
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Certain developmental disorders with a genetic component to etiology, whose mode 
of inheritance has been difficult to determine or whose genetic factors have been 
difficult to locate, including those mentioned earlier, are candidates for the 
gene-teratogen model. 

5 MODEL 2 : 

DNA POL YMORPHISM-DIET-COF ACTOR-DEVELOPMENT HYPOTHESIS 
FOR SCHIZOPHRENIA AND OTHER DEVELOPMENTAL DISORDERS 

Folate metabolism is complex. At least 30 gene loci are involved in absorption, 
transport, and metabolism of folate, and these are regulated by additional gene loci. 
10 Any of these is potentially a genetic risk factor for schizophrenia, although MTHFR 
and DHFR are particularly good candidates. Likewise, genes encoding proteins 
involved in the pathways of other vitamin-cofactors may be genetic risk factors. 

Two cofactors that may be of particular potential importance are cobalamin and 
pyridoxine. Cobalamin is relevant because its metabolism is closely intertwined with 

1 5 that of folate. For example, cobalamin is required for the activity of methionine 
synthase (MTR), a folate-related enzyme. Decreased cobalamin can affect folate 
metabolism through the folate trap. Pyridoxine is relevant because the 
pyridoxine-dependent enzyme cystathionine beta-synthase (CBS), along with the 
cobalamin-dependent enzyme MTR and folate pathways including MTHFR and 

20 DHFR all participate in catabolism of homocysteine, an amino acid that is suspected 
of being a teratogen during pregnancy. Also, kynureninase, an important enzyme 
affecting niacin metabolism and serotonin synthesis is pyridoxine-dependent. 
Therefore, mutations of the genes encoding such proteins, especially common 
polymorphisms, could play a role in the cause of schizophrenia. 

25 Since folate, cobalamin, and pyridoxine are all dietary constituents, the dietary 
content of these cofactors could be lead to an "environmental" generation of a risk 
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factor for schizophrenia. In addition genes encoding proteins involved in folate, 
cobalamin, and pyridoxine metabolism and catabolism could be genetic risk factors 
for schizophrenia. Thus, the cofactors and the proteins involved in pathways relevant 
to these cofactors can potentially have either or both environmental and genetic 
5 effects on the susceptibility of an individual on schizophrenia. 

Since the genetic aspect of schizophrenia differs so profoundly from other disorders 
which have been identified by linkage mapping techniques, it is clear that a new 
model for the genetic connection to schizophrenia is required. Therefore, the DNA 
Polymorphism-Diet-Cofactor-Development (DDCD) hypothesis, is disclosed herein. 

10 The DDCD hypothesises that interacting genetic and environmental factors affecting 
the metabolism of folate, cobalamin, or pyridoxine or all of these, play a role in the 
etiology of schizophrenia. The genetic effect results from the aggregate effect of 
multiple mutations that individually, for the most part, have small effects on folate-, 
cobalamin- or pyridoxine-related genes, some of which will be common in the 

15 population, and can act in utero. Environmental factors include dietary folate and 
cobalamin and pyridoxine. If schizophrenia results from mild deficiency during fetal 
development of dietary folate, cobalamin, or pyridoxine potentiated by mild genetic 
susceptibility mutations of genes related to these cofactors and by pregnancy, then 
this would be difficult to document by linkage mapping techniques. An example of 

20 interaction of genetic and environmental factors is that genetic factors are important 
for incorporating dietary folate; the enzyme dihydrofolate reductase is required for 
conversion of dietary folate to folinic acid thus allowing dietary folate to enter the 
body's metabolic pathways. Another example is that folate and cobalamin 
requirements increase during pregnancy; thus pregnancy could potentiate the effects 

25 of mild genetic defects of mother, fetus, or both. Deficiencies of a vitamin are often 
part of a broader dietary deficiency affecting multiple nutrients in addition to the 
vitamin being measured. 
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Locus Heterogeneity : The metabolic pathways of folate, cobalamin, and pyridoxine 
are complex and related to each other. Multiple gene loci code for the enzymes and 
transport proteins are required (Tables 2-7). Thus, a defect of folate, cobalamin, or 
pyridoxine metabolism could result from the aggregate effect of multiple mutations 
5 each of relatively small effect interacting with environmental factors. Different 

individuals might have different combinations of mutations. Such a metabolic defect 
would be difficult to detect by linkage mapping techniques because of locus 
heterogeneity. 

Alternatively, even if one genetic defect were sufficient to make an individual more 
10 susceptible to having schizophrenic offspring, for example, because of the large 
number of potential genetic factors, and the corresponding importance of 
environmental factors, elucidation of such an individual genetic defect would still be 
difficult unless, of course, the genetic defect caused a major effect. The difficulty in 
elucidating an individual genetic defect is magnified when the genetic factor acts in 
15 the mother, and not in the schizophrenic patient. 

High Disease Allele Frequency : Numerous mutational variants of folate and 
cobalamin genes are known. Some of these have functional significance and in 
addition are sufficiently common in a given population to be regarded as genetic 
polymorphisms. However, these common alleles are unlikely to have a major 

20 harmful effect by themselves, for if they did they would become uncommon in the 
population in the absence of selection effects, and would likely appear as Mendelian 
disorders. Thus, the folate, cobalamin, or pyridoxine disease alleles related to 
schizophrenia would appear to be more likely those of minor deleterious effect or 
those with harmful effect only in the presence of environmental deficiencies or 

25 pregnancy. Such disease genes of high population frequency will be difficult to 
detect by linkage mapping methods because high disease allele frequency decreases 
the power of linkage studies [Terwilliger and Ott, Handbook of Human Genetic 
Linkage, John Hopkins Univ. Press, Baltimore, (1994)]. 
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Developmental Genes: Folate, cobalamin, and pyridoxine defects act prenatally as 
well as postnatally. Folate, cobalamin, and pyridoxine metabolism are crucial for 
DNA synthesis and cell division, which are of disproportionate importance during 
brain development. Some defects of folate, cobalamin, or pyridoxine metabolism 

5 elevate blood homocysteine, a toxic and potentially teratogenic substance. Genes 
acting in the mother to damage the developing fetus, e.g. via the gene-teratogen 
model (Model 1, above), have a mode of inheritance that is neither dominant nor 
recessive with respect to the fetus. Attempts to assign a mode of inheritance in this 
situation will be unsatisfactory because affection status would be incorrectly assigned. 

1 0 The mode of inheritance of a developmental disorder resulting from a teratogenic 
locus would be regarded as either multifactorial or unknown. This is the situation 
with schizophrenia whose mode of inheritance is unknown. Use of an incorrect 
genetic model decreases the power of a linkage studies [Terwilliger and Ott, 
Handbook of Human Genetic Linkage, John Hopkins Univ. Press, Baltimore, 

15 (1994)]. 

Genes of Folate Metabolism: Folate metabolism is extremely complex [Rosenblatt, 
In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et al (eds), New 
York: McGraw-Hill, pp. 31 11-3128 (1995); Muddef al, In: The Metabolic and 
Molecular Bases of Inherited Disease, Scriver et al. (eds), New York: McGraw-Hill 
20 pp. 1279-1327 (1995)]. At least 30 gene loci (Table 2) have been identified as 
folate-related. These contribute to folate mediated 1 -carbon transfer reactions, 
binding, transport and metabolism of folate, and other functions. A number of these 
have been cloned and localized to a chromosomal region (Table 3). 
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TABLE 2 

FOLATE-RELATED GENES/ENZYMES/TRANSPORTERS" 





Folate-Related Genes/Enzymes/Tranporters 8 


SEQ ID NO: 




methylenetetrahydrofolate reductase, MTHFR, MIM 236250 


1 


5 


methionine synthase (methyltetrahydrofolate:L-homocysteine 
S-methyltransferase), MTR, MIM 1 56570 


2 




dihydrofolate reductase, DHFR, MIM 1 26060 


3 




folylpolyglutamate synthase, FPGS, MIM 136510 


4 


10 


folate receptor 1, folate receptor alpha (FOLR1, adult; FR-alpha), 
MIM 136430 


5 




folate receptor 2, folate receptor beta (FOLR2, fetal; FR-beta), 
MIM 136425 (a.a.) 


6 




folate receptor 2-like (FOLR2L, fetal-like), MIM-none 


- 




folate receptor gamma (FR-garnrna), MIM 602469 


1 


15 


serine hydroxymethyltransferase 1 , SHMT1 , MIM 1 82 1 44 


8 




methylenetetrahydrofolate dehydrogenase, methenyltetrahydrofolate 
cyclohydrolase, 10-formyltetrahydrofolate synthetase (trifunctional enzyme, 
MTHFD), MIM 172460 


9 




serine hydroxymethyltransferase 2, SHMT2, MIM 1 38450 


10 


20 


thymidylate synthase, TYMS, MIM 1 88350 


11 




GAR (5-phosphoribosylglycineamide) transformylase, GART, MIM 138440 


12 




reduced folate carrier- 1, RFC1. Probably identical to micromolar membrane 
transport protein, intestinal folate carrier- 1 (IFC1), and neutral folate transport 
protein. MIM 600424 


13 


25 


cystathionine beta-synthase, CBS, MIM 236200 


14 




AICAR (5-phosphoribosyl-5-aminoimidazole-4-carboxamide) transformylase 


15 




glutamate forrrununotransferase; MIM 229100 






forminotetrahydrofolate cyclodeaminase 






5, 10-methenyltetrahydrofolate synthetase 


16 


30 


10-formyltetrahydrofolate dehydrogenase, Mim 600249 
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TABLE 2 

FOLATE-RELATEDGENES/ENZYMES/TRANSPORTERS" 



Folate-Related Genes/Erizymes/Tranporters* 


SEQ ID NO: 




methylenetetrahydrofolate reductase, MTHFR, MIM 236250 


1 


5 


methionine synthase (methyltetrahydrofolate:L-homocysteine 
S-methyltransferase), MTR, MIM 156570 


2 




dihydrofolate reductase, DHFR, MIM 126060 


3 




folylpolyglutamate synthase, FPGS, MIM 1 365 1 0 


4 


10 


folate receptor 1, folate receptor alpha (FOLR1, adult; FR-alpha), 
MIM 136430 


5 




folate receptor 2, folate receptor beta (FOLR2, fetal; FR-beta), 
MIM 136425 (a.a.) 


6 




folate receptor 2-like (FOLR2L, fetal-like), MIM-none 






folate receptor gamma (FR-gamma), MIM 602469 


7 


15 


serine hydroxymethyitransferase 1, SHMT1 , MIM 1 82 144 


8 




methylenetetrahydrofolate dehydrogenase, methenyltetrahydrofolate 
cyclohydrolase, 10-formyltetrahydrofolate synthetase (trifunctional enzyme, 
MTHFD), MIM 172460 


9 




serine hydroxymethyitransferase 2, SHMT2, MIM 1 38450 


10 


20 


thymidylate synthase, TYMS, MIM 1 88350 


11 




GAR (5-phosphoribosylglycineamide) transformylase, GART, MIM 138440 


12 




reduced folate carrier- 1, RFC1. Probably identical to micromolar membrane 
transport protein, intestinal folate carrier- 1 (IFC1), and neutral folate transport 
protein. MIM 600424 


13 


25 


cystathionine beta-synthase, CBS, MIM 236200 


14 




AICAR (5-phosphoribosyl-5-aminoimidazole-4-carboxamide) transformylase 


15 




glutamate formiminotransferase, MIM 229100 






forminotetrahydrofolate cyclodeaminase 






5, 10-methenyltetrahydrofolate synthetase 


16 


30 


1 0-formyltetrahydrofolate dehydrogenase, Mim 600249 
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Folate-Related Genes/Enzymes/Tranporters* 


SEQ ID NO: 


glycine cleavage pathway (SHMT plus three enzymes): 
MIM 238331 
Gly-decaxboxylase MIM 238300 
H-Protein MIM 238330 
T-Protein MIM 238310 


17 
18 
19 


cblG (affects function of MTR), MIM 250940 




methionine adenosyltransferase 1, MAT1A, (ATP:L-methionine S- 
adenosyltransferase), MIM 250850 


20 


pteroyl polyglutamate hydrolase ("conjugase"), form 1 




pteroyl polyglutamate hydrolase ("conjugase"), form 2 




NAD- dependent enzyme methylene tetrahydrofolate dehydrogenase 
cyclohydrolase (a.a.) 


21 


methionine adenosyltransferase 2, MAT2A, MIM 601468 


22 


5-methyltetrahydrofolate- homocysteine methyltransferase reductase (MTRR) 
MIM 602568; #Variant in MTRR linked to cblE MIM 236270 


23 


methy ltran fera ses 




S-adenosylmethionine decarboxylase, MIM 180980 


24 


decarboxylated S-adenosylmethionine:putrescine propylaminotransferase or 
spermidine synthetase (a.a.) 


25 


S-adenosylhomocysteine hydrolase, , MIM 1 80960 


26 


betaine-homocysteine methyltransferase dimethylthetin-homocysteine 
methyltransferase 


27 


garnma-cystathionase (L-cystationine cysteine-lyase (deaminating)), 
MIM 602888 


28 


folic acid transport protein, MIM 229050 




DHFR (exon 6 and 3 'flanking region) 


30 


kynureninase 


35 


human DHFR, exons 1 and 2 [Chen et al y J. Biol Chem. 259:3933-3943 
(1984)] 


36 


"listed with alternate names, abbreviations, and MIM numbers; 

#cblE is a phenotype for a particular group of disorders of folate/cobalamin metabolism, 
(a.a.) indicates the amino acid sequence 
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SEQ ID NO: 
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25 


20 
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25 
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36 
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LOCALIZED GENE LOCI RELATED TO FOLATE METABOLISM 





Gene/enzyme/transport protein 


Location 


References 




MTHFR 


lp36.3 


Goyette et al ,(1994); 
> 


5 


MTR 


lq43 


Cook and Hamerton, (1979); 

Mellman et al., (1979) 
** 




DHFR 


5ql 1.2-13.2 


Weiffenbachetal.,(1991) , 
Gilliam et al. (1989b) 

* ** 

> 




FPGS 


9cen-q34 


Jones and Kao (1984): 
Walters al. (1992) 




MAT 


10q22 


** 




FR 


Ilql3.3-ql4.1 
llql3.3-l 13.5 


Laceyera/. (1989), 
Ragoussis et al, (1992); 
Ratnumef a/. (1989); 
Walters al (1992); * 
Ragoussis et al, (1992), ** 


10 


SHMT2 


12ql2-ql4 
12ql3 


Gan-ow^a/.,(1993); 

Law and Kao, (1979)* 
** 




MTHFD 


14q24 


Rozen et a/., (1989), 
Jones et al (1981), *, ** 




LCCL 


16pter-qter 


* ** 




SHMT1 


17pl 1.2 . 


Garrow et al, (1993) *, ** 




TYMS 


1 O 11 11 — 1 1 

lSpl l-3l.-pll.22 
I8pll.32 


* 

Hori et al, (1990); 
Silverman et al, (1993) 


15 


SAHH 


20cen-ql3.l 






GART 


21q22.l 


Mclnnis e/a/. (1993) 
Schild era/. (1990) 
Avrarmopoulos et al (1993) 

Goto et al (1993) 

* ** 

> 




RFC! 


2lq22.2-22.3 


Moscow e/ a/., (1995) 
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Gene/enzyme/transport protein 



Location 



References 



CBS 



21q22.3 



Munke e/ a/., (1988) 



10 



15 



20 



25 



30 



35 



notes: MTHFR=methylenetetrahydrofolate reductase. MTS=methionine synthase. 
DHFR=dihydrofolate reductase. FPGS=folylpolyglutamate synthase. 
MAT=methionine adenosyltransferase, (ATP:L-methionine S-adenosyltransferase). 
FR=folate receptor complex: FR-alpha=FOLRl=folate receptor 1, adult; FR- 
beta=FOLR2=folate receptor 2, fetal; FR-gamma; FOLR2L=folate receptor 2-like. 
SHMT2=serine hydroxymethyltransferase 2, mitochondrial. MTHFD=5, 10- 
methylenetetrahydrofolate dehydrogenase, 5, 10-methylenetetrahydro folate 
cyclohydrolase, 10-formytetrahydrofolate synthase (tri functional enzyme). 
LCCL=gamma-cystathionase (L-cystathionine cysteine-lyase (deaminating). 
SHMTl=serine hydroxymethyltransferase 1, soluble. TYMS^hymidylate 
synthetase. SAHH, S-adenosylhomocysteine hydrolase. 
GART=phosphoribosylglycineamide formyltransferase. RFCl==reduced folate 
carrier- 1 (possibly identical to IFC1, intestinal folate carrier- 1). CBS=cystathionine 
beta-synthase. Location information from GOD (*), from MIM (**). 
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Gene/enzyme/transport protein 



Location 



References 
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beta=FOLR2=folate receptor 2, fetal; FR-gamma; FOLR2L=folate receptor 2-like. 
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and Rosenberg, In: The Metabolic and Molecular Bases of Inherited Disease, 
40 Disease, Scriver et al. (eds), New York: McGraw-Hill, 3129-3149 (1995)]. At least 
15 gene loci (Table 4) have been identified as cobalamin-related. These contribute to 
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the binding, transport, and metabolism of cobalamin, and its functions. A number of 
these have been cloned and localized to a chromosomal region (5). Cobalamin 
metabolism is closely intertwined with that of folate. For example, cobalamin is 
required for the activity of MTR, a folate-related enzyme. Decreased cobalamin can 
affect folate metabolism through the folate trap [Rosenblatt, In: The Metabolic and 
Molecular Bases of Inherited Disease, Scriver et al (eds), New York: McGraw-Hill, 
pp. 3111-3128 (1995); Quadros et al, Biochem. Biophys. Res. Commun., 222:149- 
154(1996)]. 
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TABLE 4 



COBALAMIN-RELATED GENES/ENZYMES/TRANSPORTERS" 



Cobalamin-RelatedGenes/Enzymes/Tranporters 8 * 


SEQIDNO: 


(gastric) intrinsic factor, GIF, MIM-261000 (combined deficiency of 
GIF & R-binder, MIM 243320 


31 


intrinsic factor receptor, IFCR, MIM-261 100 




transcobalamin I, TCI (an R-protein, plasma), MIM 1 89905 


32 


transcobalamin III, TCIII (an R-protein, plasma), MIM-none 




other R-proteins (R-binders, cobalophylins, haptocorrins), MIM 193090 




transcobalamin II, TCII MIM 275350 


33 


transcobalamin II receptor, TCII receptor, MIM-none 




methylmalonyl Co- A mutase, MCM (MUT locus), MIM 251000 


34 


cblF, lysosomal cbl efflux, MIM 277380 




cblC, cytosolic cbl metabolism, MIM 277400 




cblD, cytosolic cbl metabolism, MIM 277410 




cblA, mitochondrial cbl reduction, (AdoCbl synthesis only), MIM 
251100 




cblB, cob(I)alamin adenosyltransferase, (AdoCbl synthesis only), MIM 
251110 




cblE, methyltransferase-associated cbl utilization, MIM 236270 




cblG, methyltransferase-associated cbl utilization, MIM 250940 




a listed with alternate names, abbreviations, and MIM numbers 
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TABLE 4 



COBALAMIN-RELATED GENES/ENZYMES/TRANSPORTERS 3 



Cobalamin-Related Genes/Enzymes/Tranporters* 


SEQ ID NO: 


(gastric) intrinsic factor, GIF, MIM-261000 (combined deficiency of 
GIF & R-binder, MIM 243320 


31 


intrinsic factor receptor, IFCR, MIM-261 100 




transcobalamin I, TCI (an R-protein, plasma), MIM 1 89905 


32 


transcobalamin III, TCIII (an R-protein, plasma), MIM-none 




other R-proteins (R-binders, cobalophylins, haptocorrins), MIM 193090 




transcobalamin II, TCII MIM 275350 


33 


transcobalamin II receptor, TCII receptor, MIM-none 




methylmalonyl Co- A mutase, MCM (MUT locus), MIM 25 1 000 


34 


cblF, lysosomal cbl efflux, MIM 277380 




cblC, cytosolic cbl metabolism, MIM 277400 




cblD, cytosolic cbl metabolism, MIM 277410 




cblA, mitochondrial cbl reduction, (AdoCbl synthesis only), MIM 
251100 




cblB, cob(I)alamin adenosyltransferase, (AdoCbl synthesis only), MIM 
251110 




cblE, methyltransferase-associated cbl utilization, MIM 236270 




cblG, methyltransferase-associated cbl utilization, MIM 250940 




"listed with alternate names, abbreviations, and MIM numbers 



WO 00/71754 PCT/US00/14354 

60 
TABLE 5 

LOCALIZED GENE LOCI RELATED TO COBALAMIN METABOLISM 





Gene/enzyme/transport protein 


Location 


References 




MCM(MUT locus) 


6p21.2-p21.1 


Qureshi etal. (1994) * 


5 


IF/GIF 


Hql2-ql3 


Hewit et al. (1991)* 




TCI (an R-protein, plasma) 


llqll-ql2.3 


Johnston et al., (1992) 
Sigal etal., (1987), * 




TCII 


22qll.2-ql3 
22ql2/13 border 


Lie/ al., (1995) 


10 


notes: MCM=methymalonyl Co-A mutase; IF/GIF=(gastric) intrinsic factor; 
TCI=transcobalmin I; TCIl=transcobalamin II. Location information from GDB (*), from 
MIM (**). 


15 


Qureshi et al., Crit. Rev. Oncol. Hematol. 17: 133-151 (1994) 

Hewit et al, Genomics 10:432-440 (1991) 

Johnston et a/.,Genomics 12:459-464 (1992) 

Sigal et al.JJ. Engl. J. Med. 317:1330-1332 (1987) 

Li et al., Biochem. Biophys. Res. Comm. 208:756-764 (1995) 



Genes of Pyridoxine Metabolism: Pyridoxine metabolism is also complex with three 
dietary forms convertible to pyridoxal phosphate [Whyte et al y Hypophosphatasia, 
In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et al (eds), New 
York: McGraw-Hill pp. 4095-41 1 1 (1995)] and many pyridoxine-related and 

20 pyridoxine-dependent enzymes including decarboxylases and all aminotranferases 
(Table 6). A number of pyridoxine-related enzymes have been cloned and localized 
to a chromosomal region (Table 7). Pyridoxine metabolism is related to folate 
metabolism, especially 1-carbon transfer reactions: both serine 
hydroxymethyltransferases and the P-protein (glycine decarboxylase) of the glycine 

25 breakdown system are pyridoxine-dependent. 
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TABLE 5 



LOCALIZED GENE LOCI RELATED TO COBALAMIN METABOLISM 





Gene/enzyme/transport protein 


Location 


References 




MCM (MUT locus) 


6p21.2-p21.1 


Qureshi et aL (1994) * 


5 


IF/GIF 


Ilql2-ql3 


Hewit et aL (1991)* 




TCI (an R-protein, plasma) 


llqll-q!2.3 


Johnstone/ aL, (1992) 
Sigal etal., (1987), * 




TCII 


22qlL2-ql3 
22q 12/1 3 border 


Li et aL, (1995) 


10 


notes: MCM^methymalonyl Co-A mutase; IF/GIF=(gastric) intrinsic factor; 
TCI=transcobalmin I; TCII=transcobalamin II. Location information from GDB (*), from 
MIM (**). 


15 


Qureshi et aL, CriL Rev. Oncol. HematoL 17:133-151 (1994) 

Hewit etal, Genomics 10:432-440 (1991) 

Johnston et a/.,Genomics 12:459-464 (1992) 

Sigal et al.,N. Engl. J. Med. 317:1330-1332 (1987) 

Li et aL. Biochem. Biophys. Res. Comm. 208:756-764 (1995) 



Genes ofPyridoxine Metabolism: Pyridoxine metabolism is also complex with three 
dietary forms convertible to pyridoxal phosphate [Whyte et aL, Hypophosphatasia, 
In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et aL (eds), New 
York: McGraw-Hill pp. 4095-41 1 1 (1995)] and many pyridoxine-related and 

20 pyridoxine-dependent enzymes including decarboxylases and all aminotranferases 
(Table 6). A number of pyridoxine-related enzymes have been cloned and localized 
to a chromosomal region (Table 7). Pyridoxine metabolism is related to folate 
metabolism, especially 1 -carbon transfer reactions: both serine 
hydroxymethyltransferases and the P-protein (glycine decarboxylase) of the glycine 

25 breakdown system are pyridoxine-dependent. 
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TABLE 6 

SOME PYRIDOXINE-RELATED GENES/ENZYMES/* 

1 . cystathionine beta-synthase, CBS, MIM 236200 

2. gamma-cystathionase, MIM 219500 
5 (L-cystathionine cysteine-lyase, deaminating), LCCL 

3. glycine cleavage system (GCS): glycine decarboxylase (P-protein) 

4. serine hydroxymethyltransferase 1 , SHMT1 , MIM 1 82 1 44 

5. serine hydroxymethyltransferase 2, SHMT2, MIM 1 38450 
10 6. kynureninase MIM 278600 

7. all aminotransferases, MIM 258870 

(e.g. ornithine-gamma-aminotranferases, OAT, ) 

8. decarboxylases, MIM 266100 
15 r e.g. glutamic acid decarboxylases, GAD1, GAD2, 

9. pyridoxamine(pyridoxine)-5' -phosphate oxidase MIM 603287 



a listed with alternate names, abbreviations, and MIM numbers. 
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TABLE 6 

SOME PYRIDOXINE-RELATED GENES/ENZYMES/" 

1 . cystathionine beta-synthase, CBS, MIM 236200 

2. gamma-cystathionase, MIM 219500 
5 (L-cystathionine cysteine-lyase, deaminating), LCCL 

3. glycine cleavage system (GCS): glycine decarboxylase (P-protein) 

4. serine hydroxymethyltransferase 1, SHMT1, MIM 182144 

5. serine hydroxymethyltransferase 2, SHMT2, MIM 138450 
10 6. kynureninase MIM 278600 

7. all aminotransferases, MIM 258870 

(e.g. omithine-gamma-aminotranferases, OAT, ) 

8. decarboxylases, MIM 266100 
15 e.g. glutamic acid decarboxylases, GAD1, GAD2, 

9. pyridoxamirie(pyridoxine)-5' -phosphate oxidase MIM 603287 



a listed with alternate names, abbreviations, and MIM numbers. 
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TABLE 7 

SOME LOCALIZED GENE LOCI RELATED TO PYRIDOXINE METABOLISM 



10 



Gene/enzyme 

1 . GAD2 

2 . GCS P-protein 

3 . GAD1 

4 . OAT 

5 . SHMT2 

6 . LCCL 

7 . SHMT1 

8. CBS 

9. PNPO (PPO) 



Location 
2q31, 
9pl3 

10pll-23 

10q26 

12ql2-14 

16pter-qter 
17pll.2 
21q22 .3 



References 

Bu et al. , 1992) 

Hamosh et al. 1995) 

Bu et al. 1992) 
** 

Garrow et al . , 1993; 
Law and Kao, 1979 
* * * 

Garrow et al . 1993 * ** 
Munke et al. 198 8 
Ngo et al. 1998 



a listed with alternate names, abbreviations, and MIM numbers. 
15 Location information from GDB (*), from MIM (**). 

notes: GAD2=glutamic acid decarboxylase 2, 67 kDa. GCS=glycine cleaving system, 
P-protein^glycine decarboxylase subunit. GAD 1 =glutamic acid decarboxylase 1 , 
65 kDa. OAT=ornithine-gamma-aminotranferases. SHMT2=serine 
hydroxy methyl transferase 2, mitochondrial. LCCL=gamma-cystathionase 
20 (L-cystathionine cysteine- lyase (deaminating). SHMTl=serine 

hydroxymethyltransferase 1, soluble. CBS=cystathionine beta-synthase. PNPO= 
pyridoxamine(pyridoxine)-5 '-phosphate oxidase 
References : 

Bu et al, Proc. Nat Acad. 5c/., 89:2115 (1992). 
25 Hamosh et al, In : 'The Metabolic and Molecular Bases of Inherited Disease", 
Scriver et al (eds), New York: McGraw-Hill pp.1337-1348 (1995). 

Garrow et al J. Biol Chem. 268:1 1910-1 1916 (1993). 

Law and Kao, Cytogenet Cell Genet, 24: 102-114(1979). 

Munke et al Am J. Hum. Gen. 42:550-559 (1988). 
30 Ngo et al Biochemistry 37:7741-7748 (1998). 
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TABLE 7 

SOME LOCALIZED GENE LOCI RELATED TO PYRIDOXINE METABOLISM 



Gene/enzvme 
1. GAD 2 
5 2. GCS P-protein 
3 . GAD1 

4 . OAT 

5 . SHMT2 

10 6. LCCL 

7. SHMT1 

8 . CBS 

9. PNPO (PPO) 



Location 
2q31, 
9pl3 

10pll.23 

10q26 

12ql2-14 

16pter-qter 
17pll.2 
21q22 .3 



References 
Bu et al. 1992) 
Hamosh et al. 1995) 

Bu et al. 1992) 

* * 

Garrow et al . , 1993; 
Law and Kao, 1979 

★ * * 

Garrow et al. 1993 * ** 
Munke et al. 198 8 
Ngo et al. 1998 



a listed with alternate names, abbreviations, and MIM numbers. 
15 Location information from GDB (*), from MIM (**). 

notes: GAD2=glutamic acid decarboxylase 2, 67 kDa. GCS=glycine cleaving system, 
P-protein=glycine decarboxylase subunit. GADl=glutamic acid decarboxylase 1, 
65 kDa. OAT=ornithine-gamma-aminotranferases. SHMT2=serine 
hydroxymethyltransferase 2, mitochondrial. LCCL=gamma-cystathionase 
20 (L-cystathionine cysteine-lyase (deaminating). SHMTl=serine 

hydroxymethyltransferase 1 , soluble. CBS=cystathionine beta-synthase. PNPO= 
pyridoxamine(pyridoxine)-5 '-phosphate oxidase 
References : 

Bu et al, Proa Nat. Acad ScL, 89:21 15 (1992). 
25 Hamosh et al, In: "The Metabolic and Molecular Bases of Inherited Disease", 
Scriver et al (eds), New York: McGraw-Hill pp.1337-1348 (1995). 

Garrow et al J. Biol Chem. 268:11910-11916(1993). 

Law and Kao, Cytogenet Cell Genet, 24: 102-1 14 (1979). 

Munke et al Am J. Hum. Gen. 42:550-559 (1988). 
30 Ngo et al Biochemistry J7;774l-7748 (1998). 
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Relevance of Folate, Cobalamine, And Pyridoxine to Schizophrenia: There is 
considerable evidence that schizophrenia results, at least in part, from damage to brain 
development in utero that becomes symptomatic in late adolescence or early 
adulthood. The etiology of schizophrenia has both genetic and environmental 

5 components. Because folate, cobalamin, and pyridoxine are all ingested and 
metabolized, they could potentially be both environmental and genetic factors for 
schizophrenia. Folate, cobalamin, and pyridoxine are relevant to schizophrenia in 
important ways. First, all of them are required for cell division because of their role in 
nucleic acid synthesis [Rosenblatt, In: The Metabolic and Molecular Bases of 

1 0 Inherited Disease, Scriver et ah (eds) New York: McGraw-Hill, pp. 3 1 1 1 -3 1 28 

(1995); Benton and Rosenberg, In: The Metabolic and Molecular Bases of Inherited 
Disease, Scriver et al (eds)., New York: McGraw-Hill, 3129-3149 (1995)]. The 
developmental brain insult implicated in schizophrenia [Akbari an et al, Arch. Gen. 
Psychiatry, 50:169-177 (1993); Akbarian etal.Arch. Gen. Psychiatry, 50:178-187 

15 (1993)] is an abnormality of neurogenesis and neuronal migration, which are 

midtrimester events requiring cell division. Thus folate, cobalamin, and pyridoxine 
deficiencies could result in the widespread decreased grey matter volume observed in 
schizophrenia. 

Individuals that become schizophrenic later in life are more likely to be born during 
20 the winter and early spring [Boyd et al, Schizophr. Bull, 12:173-186 (1986); Kendell 
and Adams, Br. J. Psychiatry, 158:758-763 (1991); O'Callaghan et al, Br. J. 
Psychiatry, 158:764-769 (1991)]; this corresponds to midtrimester in late fall & 
winter. Many folate- and pyridoxine-containing foods, e.g. dark green leafy 
vegetables, are less readily available in late fall & winter in northern climates. 
25 Seasonality was found to be a major determinant of micronutrient status including 
folate status in a population of pregnant and lactating women in The Gambia where 
folate deficiency was widespread [Bates et al Eur. J. Clin. Nutr. 48:660-668 (1994)]. 
Dietary cobalamin comes from animal foods, e.g. meat, dairy products, and fish, and 
prolonged dietary insufficiency is required to produce cobalamin deficiency unless a 
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person is a strict vegetarian or already has subclinical deficiency [Sanders and Reddy, 
Am. J. Clin. Nutr., 59:1 176S-1 181S (1994)]. In fact, a significant fraction of the 
population already has subclinical deficiency for folate [Lewis et al, Ann. NY Acad. 
Scl, 678:360-362 (1993)] and for [Caimel et al, Arch. Intern. Med., 147:1995-1996 

5 (1987); Pennypacker et al, J, Am. Geriatr. Soc, 40:1 197-1204 (1992); Naurath et al, 
Lancet., 346:85-89 (1995); Allen et al, Am. J. Clin. Nutr., 62:1013-1019 (1995); 
Black et al, J. Nutr., 124:1 179-1 188 (1994)]. Also, the dietary folate requirement 
increases during pregnancy [Scholl et al, Am. J. clin. Nutr., 63:520-525 (1996); 
McPartlin et al, Lancet, 341:148-149 (1993)] and most women become folate 

10 deficient during late pregnancy [Giles, J. Clin. Pathol, 19:1-1 1 (1966)]. Cobalamin 
deficiency is also common during pregnancy [Gadowsky et al, J. Adolesc. Health, 
16:465-474 (1995)] although subnormal levels of vitamin B12 during pregnancy must 
be interpreted with caution [Metz et al, Am. J. Hemetol, 48:251-255 (1995)]. An 
increase in schizophrenia births has also been noticed after winter famine [Susser and 

15 Lin, Arch. Gen. Psychiatry, 49:983-988 (1992)]; Susser et al, Arch. Gen. Psychiatry, 
53:25-31 (1996)], a time when severe dietary deficiency of both folate and cobalamin 
is more likely. A temporary increase in the incidence of neural tube defects was 
reported in Jamaica 1 1-18 months following Hurricane Gilbert and was found to be 
associated with decreased dietary folate [Duff and Cooper, Am J. Pub.Health 84:473- 

20 476(1994)]. 

Schizophrenia is also associated with obstetrical complications, e.g. low birth weight 
and prematurity [Lewis and Murray, J. Psychiatr. Res., 21:413-421 (1987)]. Low 
birthweight and prematurity have also been associated with dietary folate deficiency 
during pregnancy Scholl et al, Am. J. clin. Nutr., 63:520-525 (1996). 
25 Hyperhomocysteinemia is a risk factor for unexplained recurrent early pregnancy loss 
[Wouters et al, Fertil Steril, 60:820-825 (1993)] and for abruptio placentae 
[Goddijn-Wesel et al, Eur. J. Obstet. Gynecol Reprod. Biol, 66:23-29 (1996)]. 
Hyperhomocysteinemia may be related to defects in folate-, cobalamin-, or 
pyridoxine-dependent reactions [Naurath et al, Lancet., 346:85-89 (1995)]. 
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16:465-474 (1995)] although subnormal levels of vitamin B12 during pregnancy must 
be interpreted with caution [Metz et al, Am. J. Hemetol, 48:251-255 (1995)]. An 
increase in schizophrenia births has also been noticed after winter famine [Susser and 

15 Lin, Arch. Gen. Psychiatry, 49:983-988 (1992)]; Susser et al, Arch. Gen. Psychiatry, 
53:25-31 (1996)], a time when severe dietary deficiency of both folate and cobalamin 
is more likely. A temporary increase in the incidence of neural tube defects was 
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20 476(1994)]. 
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Interestingly, stillbirths and schizophrenia share a similar seasonality of birth excess 
[Torrey et al, Schizophr. Bull, 19:557-562 (1993)]. Also N 2 0, an anaesthetic gas 
that inhibits MTR, a cobalamin-requiring enzyme of folate metabolism, is a 
reproductive toxin for both men and women [Louis-Ferdinand, Adverse Drug React, 
5 Toxicol Rev., 13:1 93-206 ( 1 994)]. Methotrexate, an inhibitor of dihydrofolate 
reductase (DHFR), induces abortion. 

Dietary folate deficiency and low plasma folate are common in inner city urban 
populations [Scholl et al, Am. J. din. Nutr., 63:520-525 (1996)]. Likewise, 
schizophrenia has been reported to be more common in inner city urban populations 
10 [Fuller and Bowler, ScA/zop/ir. Bull., 16:591-604(1990)]. Also, both low folate 

intake [Schorah and Wild, Lancet., 341:1417 (1993)] and schizophrenia [Dohrenwned 
et al, Science, 255:946-952 (1992)] are correlated with lower socioeconomic status. 

Immune function is impaired in folate deficiency [LeLeiko and Chao, In: Rudolph 's 
Pediatrics, 20th ed., Stamford, CT: Appleton & Lange, pp. 1001-1010 (1996)], in 

15 cobalamin deficiency [Hitzig et al., Ciba. Found. Symp., 68:77-91 (1978)] and in 
pyridoxine deficiency [Trakatellis et al. Postgrad Med. J. 73:617-622 (1997)] and 
deficient individuals are more susceptible to infection; Methotrexate, an inhibitor of 
dihydrofolate reductase, inhibits immune function [Hughes, In: Rudolph s Pediatrics, 
20th ed., Stamford, CT: Appletone and Lange, pp. 517-519 (1997)]. And, as 

20 mentioned, dietary folate and cobalamin requirements increase during pregnancy 
[Scholl et al, Am. J. clin. Nutr., 63:520-525 (1996); McPartlin et al, Lancet, 
341:148-149 (1993)]. This is relevant because the season-of-birth effect just 
mentioned in connection with dietary folate, or cobalamin deficiency has also been 
explained by in utero infectious illness, the "viral theory" of schizophrenia. 

25 Individuals born following winters with severe influenza epidemics are more likely to 
develop schizophrenia [Adams et al, Br. J. Psychiatry, 163:522-534 (1993)] though 
not all studies find this effect. Although it has not been demonstrated that either the 
schizophrenia fetus or the pregnant mother actually developed influenza, the 
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histologic pattern in schizophrenia of a neuronal migration abnormality during brain 
development has been seen as compatible with a fetal viral infection [Kovelman and 
Scheibel, Biol Psychiatry, 19:1601-1621 (1984); Bogerts etal, Arch. Gen. 
Psychiatry, 42:784-791 (1985); Akbarian et al., Arch. Gen. Psychiatry, 50:169-177 

5 (1993); Akbarian etal, Arch. Gen. Psychiatry, 50:178-187 (1993)]. Thus folate or 
cobalamin, deficiency during pregnancy could result in greater susceptibility to viral 
infection affecting mother, fetus, or both. The infectious agent could be influenza 
itself. Alternatively, a severe influenza epidemic could be a "marker" of a severe 
winter, and infection by another agent could cause the brain damage. In this way, 

1 0 folate or cobalamin deficiency could cause the season-of-birth effect either through 
the mechanism of dietary deficiency alone, through maternal immune deficiency and 
infection, or both. 

Methotrexate, a DHFR inhibitor, is also an important therapeutic agent for 
rheumatoid arthritis. Rheumatoid arthritis has repeatedly been found to have a 
1 5 decreased frequency in schizophrenics, a puzzling finding that remains unexplained 
[Eaton et al, Schizophr. Res., 6:181-192 (1992)]. 

The developmental model of schizophrenia postulates that brain damage sustained in 
the second trimester of fetal life results in schizophrenia later in development [Brixey 
et al, J. Clin. Psychol, 49:447-456 (1993)]. Both folate and cobalamin are already 

20 known to contribute to a first trimester fetal nervous system malformation, spina 
bifida cystica [Kirke et al, Q. J. Med., 86:703-708 (1993); Gordon, Brain Dev., 
17:307-31 1 (1995)], and possibly other birth defects [Shaw et al, Lancet., 346:393- 
396 (1995); Czeizel, Lancet. , 345:932 (1995)]. Some studies [Whitehead et al, Q. 1 
Med., 88:763-766 (1995); van der Put et al, Lancet., 346:1070-1071 (1995); Ou et 

25 al, Am. J. Med. Genet., 63:610-614 (1996); Chatkupt et al, Am. Acad. Neurol Works 
in Progres, WIP4: (1996)] suggest that a genetic susceptibility factor for spina bifida 
is a common allele of the folate gene, MTHFR, the nucleotide 677C->T transition 
converting an alanine residue to valine resulting in a heat-labile enzyme protein. 
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Homozygotes for this allele, about 10% of the normal population, have lower 
erythrocyte folate and plasma folate during pregnancy [Molloy et al, Lancet,, 
349:1591-1593 (1997)]. Homozygotes for this allele also develop moderately 
elevated blood homocysteine [van der Put et al, Lancet, 346:1070-1071 (1995); 

5 Frosst et al, Nature Genet., 10:1 1 1-1 13 (1995)] in the presence of dietary folate 
deficiency. Moderate hyperhomocysteinemia is toxic to adults [Fermo et al., Ann. 
Intern. Med., 123:747-753 (1995)], and toxic to the fetus in early gestation [Wouters 
et al, Fertil. Steril, 60:820-825 (1993)], and possibly teratogenic in the first trimester 
causing neural tube defects [Whitehead et al, Q. J. Med., 88:763-766 (1995); van der 

10 Put et al, Lancet., 346:1070-1071 (1995); Ou et al, Am. J. Med Genet., 63:610-614 
(1996). Thus, the MTHFR heat-labile mutation, in the presence of decreased dietary 
folate in midtrimester, could be teratogenic both through hyperhomocysteinemia and 
also through folate deficiency causing the developmental brain damage hypothesized 
in the developmental model of schizophrenia [Brixey et al, J. Clin. Psychol, 49:447- 

15 456 (1 993)]. A second common polymorphism of MTHFR, the ntl298 A->C 
mutation could also be a genetic risk factor for spina bifida [van der Put et al, 
Lancet, 346:1070-1071 (1995]. 

Schizophrenia is a common disorder, affecting 1% or more of the population [Karno 
et al, In: Comprehensive Textbook of Psychiatry /VI, 6th ed., Baltimore: Williams & 

20 Wilkins, pp. 902-910 (1995)]. Thus, if a significant proportion of schizophrenia 
shares a common etiology, both the genetic susceptibility factors and the 
environmental factors must be common in the population. As mentioned earlier, a 
significant fraction of the population is already sub-clinically deficient for folate and 
for cobalamin; also, pregnancy may increase this fraction since dietary folate and 

25 cobalamin requirements increase during that time. Several functional polymorphic 
alleles of folate and cobalamin genes are also common in the population including the 
MTHFR mutations just mentioned and polymorphisms of thymidylate synthase 
[Horie et al, Cell Struct Funct, 20:191-197 (1995)], transcobalamin H [Li et al, 
Biochim. Biophys. Acta., 1219:515-520 (1994)], and folate-binding proteins [Li et a!., 
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1994, supra; Shen et al, Biochem., 33:1209-1215 (1994)]. Metabolic indicators of 
folate or cobalamin deficiency, e.g. hyperhomocysteinemia and 
hypermethylmalonicacidemia, are also common in the population [Naurath et al, 
Lancet., 346:85-89 (1 995)]. Thus there exists a statistical basis for the hypothesis 
5 that schizophrenia is a birth defect resulting from the action during gestation of 
genetic risk factors and environmental factors related to folate and/or cobalamin that 
lead to the generation of risk factors. Such factors are sufficiently common that at 
least in principle all cases of schizophrenia could result from this mechanism. 

Finally, folate, cobalamin, and pyridoxine are relevant for schizophrenia because of 
10 findings in patients. Severe genetic deficiency of MTHFR may cause a 

"schizophrenia" phenotype [Freeman et al., N Engl. J. Med., 292:491-496 (1975); 
Regland et al., J. Neural Transm. Gen. Sect, 98:143-152 (1994)]. Genetic deficiency 
of other folate and cobalamin enzymes has been reported to cause nervous system 
disease, psychiatric disease, or schizophrenia-like illness [Mudd et al, In: The 
15 Metabolic and Molecular Bases of Inherited Disease, Scriver et al (eds), New York: 
McGraw-Hill pp. 1279-1327 (1995); Hitzigef al, Ciba. Found. Syrnp., 68:77-91 
(1978); Cooper and Rosenblatt, Annu. Rev. Nutr., 7:291-320 (1987); Shevall and 
Rosenblatt, Can. J. Neurol Sci., 19:472-486 (1992); Hall, Br. J. Haematol, 80:117- 
1 20 (1 992)]. Likewise, dietary deficiencies of folate or cobalamin may have similar 
20 effects [Cooper and Rosenblatt, Annu. Rev. Nutr., 7:291-320 (1987); Shevall and 

Rosenblatt, Can. J. Neurol Scl, 19:472-486 (1992)]. Methylfolate therapy reportedly 
improved the clinical status of schizophrenics with borderline or definite folate 
deficiency [Godfrey et al, Lancet., 2:392-395 (1990); Procter, Br. J. Psychiatry, 
159:271-272 (1991)] although the improvement claimed was small and the finding 
25 controversial. Folate deficiency has been associated with disturbances in mood 
> [Shulman, In: Folic Acid in Neurology, Psychiatry, and Internal Medicine, New 
York: Raven Pr., 463-474 (1979)], and it has been suggested that the most common 
neuropsychiatric system abnormality in severe folate deficiency is depression 
[Reynolds et al, Lancet., ii:196-198 (1984)]. Methyltetrahydrofolate reportedly 
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improved symptoms of depression in an open trial in elderly depressed patients 
[Guaraldi et al Ann.Clin.Psychiatry 5:101-105 (1993)]. Schizophrenics are reported 
to have an 80% excess mortality from cardiovascular disease [Gottesman, 
Schizophrenia Genesis, Schizophrenia Genesis- The Origins of Madness, W.H. 

5 Freeman & Co. N.Y.(1991)]; hyperhomocysteinemia, dietary folate deficiency and 
the MTHFR 677C->T mutation have been implicated in cardiovascular disease in 
some studies [Morita et al, Circulation, 95:2032-2036 (1997)] but not others 
(Anderson et al, J. Am. Coll Cardiol 30:1206-121 1 (1997)]. Also, kynureninase, an 
important enzyme of tryptophan metabolism, affecting niacin metabolism and 

10 serotonin synthesis, is pyridoxine-dependent. Niacin deficiency (pellagra) can cause 
mental changes including psychosis and hallucinations [Wilson, Vitamin deficiency 
and excess, pp.472-480. In: Harrison s Principles of Internal Medicine, (Scriber et al 
e's.) McGraw-Hill, Inc., N.Y. (1994)]. Also, clozapine, resperidone, and olanzapine 
are thought to exert their antipsychotic effect in schizophrenia in part through 

1 5 serotonin receptor antagonism. 

Gene Localization Studies in Schizophrenia and Folate/Cobalamine/Pyridoxine 
Genes: If folate, cobalamin, or pyridoxine genes are susceptibility factors for 
. schizophrenia, it is possible that gene localization studies have already identified 
candidate chromosome regions that contain such a gene (Tables 3, 5, and 7). For three 
20 folate or cobalamin genes, DHFR, TCNII and TYMS, there is excellent concordance 
with schizophrenia gene localization studies. 

On chromosome 5, DHFR has been located at 5ql 1 .2-13.2. A schizophrenia 
translocation [t(l;5)(lq32.3;5ql 1.2-13.3)] was reported [McGillivray et al, Am. J. 
Med Genet, 35:10-13 (1990); Bassett, Br. 1 Psychiatry, 161:323-334 (1992)] 
25 affecting 5ql 1 .2-5ql 3.3. A proband and uncle, both with schizophrenia and 

eye-tracking abnormalities, had partial trisomy for 5ql 1.2-5ql3.3; the third copy was 

inserted at lq32.3 giving a derivative chromosome, der(l)inv 

ins(l ;5)(q32.2;ql3.3ql 1.2). The proband's mother had a balanced translocation but 
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was phenotypically normal without schizophrenia or eye-tracking abnormalities. She 
had the derivative chromosome 1 with extra material from chromosome 5 inserted but 
a corresponding deletion in one of her chromosomes 5. She thus had only two copies 
of 5ql 1.2-5ql3.3. Further studies [Gilliam et al, Genomics, 5:940-944 (1989)] 
5 showed that the DHFR gene is located within this deleted region, 5ql 1 .2-13.3. 
Another schizophrenia chromosome abnormality, inv5(pl3;ql3), has been reported 
[Bassett, Br. J. Psychiatry, 161:323-334 (1992)] affecting 5ql3. 

On chromosome 5, two-point lod scores of 4.64 and 2.29 were found [Sherrington et 
aL, Nature, 336:164-167 (1988)] for the polymorphic markers D5S76 and D5S39 

1 0 respectively in the region of the chromosome abnormality just discussed [McGillivray 
et aL, Am. J. Med. Genet., 35:10-13 (1990); Bassett, Br. J. Psychiatry, 161:323-334 
(1992)] affecting 5ql 1.2-13.3. Two other linkage studies found small positive lod 
scores in this region [Coon et aL, Biol. Psychiatry, 34:277-289 (1993); Kendler and 
Diehl, Schizophr. Bull., 19:261-285 (1993)], but numerous other studies excluded this 

1 5 region.under the assumptions and models used [Kendler and Diehl, Schizophr. Bull., 
19:261-285 (1993)]. 

On chromosome 18, TYMS has been located at 18pl 1.32-pl 1.22. A ring 
chromosome with deletion of 18pter-pl l,18q23-qter [Bassett, Br. J. Psychiatry, 
161 :323-334 (1992)] was reported in a kindred with schizophrenia and bipolar illness 
20 [Bassett, Br. J. Psychiatry, 161:323-334 (1992)]. Deletion of a segment of 18p was 
reported in a schizophrenia chromosome [Bassett, Br. J. Psychiatry, 161:323-334 
(1992)]. 

On chromosome 22, TCNII has been located at 22ql 1 .2-ql3, possibly at the 22ql2/13 
border. High lod scores have consistently been obtained in the region of TCNII: 
25 IL2RB, in 22ql2-ql3.1 gave a lod score [Pulver et aL, Am. J. Med. Genet., 54:3-43 
(1994)] of 2.82. Other markers over a broad region of 22q have given suggestive lod 
scores. D22S278, in 22ql2, gave a lod score [Vallada et aL, Am. J. Med. Genet., 



WO 00/71754 



PCT/US00/14354 



70 * 

was phenotypically normal without schizophrenia or eye-tracking abnormalities. She 
had the derivative chromosome 1 with extra material from chromosome 5 inserted but 
a corresponding deletion in one of her chromosomes 5. She thus had only two copies 
of 5qll.2-5ql3.3. Further studies [Gilliam et al., Genomics, 5:940-944 (1989)] 
5 showed that the DHFR gene is located within this deleted region, 5ql 1 .2-13.3. 
Another schizophrenia chromosome abnormality, inv5(pl3;ql3), has been reported 
[Bassett, Br. J. Psychiatry, 1 61 :323-334 (1 992)] affecting 5ql 3. 

On chromosome 5, two-point lod scores of 4.64 and 2.29 were found [Sherrington et 
al, Nature, 336:164-167 (1988)] for the polymorphic markers D5S76 and D5S39 

1 0 respectively in the region of the chromosome abnormality just discussed [McGillivray 
et al, Am. J. Med. Genet, 35:10-13 (1990); Bassett, Br. J. Psychiatry, 161:323^334 
(1992)] affecting 5qll.2-13.3. Two other linkage studies found small positive lod 
scores in this region [Coon et al, Biol Psychiatry, 34:277-289 (1993); Kendler and 
Diehl, Schizophr. Bull, 19:261-285 (1993)], but numerous other studies excluded this 

1 5 region under the assumptions and models used [Kendler and Diehl, Schizophr. Bull, 
19:261-285(1993)]. 

On chromosome 18, TYMS has been located at 18pl 1.32-pl 1.22. A ring 
chromosome with deletion of 1 8pter-pl 1 ,1 8q23-qter [Bassett, Br. J. Psychiatry, 
161 :323-334 (1992)] was reported in a kindred with schizophrenia and bipolar illness 
20 [Bassett, Br. J. Psychiatry, 161:323-334 (1992)]. Deletion of a segment of 18p was 
reported in a schizophrenia chromosome [Bassett, Br. J. Psychiatry, 161 :323-334 
(1992)]. 

On chromosome 22, TCNII has been located at 22ql 1.2-ql3, possibly at the 22ql2/13 
border. High lod scores have consistently been obtained in the region of TCNII: 
25 IL2RB, in 22ql2-ql3.1 gave a lod score [Pulver et al, Am. J. Med. Genet, 54:3-43 
(1994)] of 2.82. Other markers over a broad region of 22q have given suggestive lod 
scores. D22S278, in 22ql2, gave a lod score [Vallada et al, Am. J. Med. Genet, 



WO 00/71754 



PCT/USOO/14354 



71 

60:139-146 (1995)] of 1.51. CRYB2, in 22ql 1.2-ql2.1, gave a lod score [Lasseter et 
al,Am.JMed. Genet., 60:172-173 (1995)] of 1.71. D22S10, in 22ql 1.1-ql 1 .2, gave 
a lod score [Coon et al, Biol. Psychiatry, 34:277-289 (1993)] of 0.79. Highly 
significant p-values for non-parametric analyses have also been obtained: D22S278, 
5 in 22ql2, for example gave p=.001 [Gill et al, Am. J Med. Genet, 67:40-45 (1996)]. 

The deletions of velocardiofacial (VCF) syndrome and related disorders (DiGeorge 
syndrome (DGS) and CATCH22) are located [Lindsay et al, Genomics, 32:104-1 12 
(1996)] at 22qll.2. A psychotic disorder develops in about 10% of patients with 
VCF syndrome [Chow et al, Am. J Med. Genet., 54:107-1 12 (1994)]. TCNII is not 

10 known to be located at or within these deletions. VCF and related disorders are 
relatively uncommon compared to schizophrenia; only 2 of 100 randomly selected 
patients (92 schizophrenics, 5 with schizoaffective disorder, and 3 with 
schizophreniform disorder) in the Maryland Epidemiological Sample were found 
[Lindsay et al, Am. J Hum. Genet., 56:1502-1503 (1995)] to have VCF-related 

1 5 deletions (and later VCF syndrome) on 22q 1 1 .2. Consequently, it is not clear 
whether schizophrenia linkage studies are detecting a haplotype related to a VCS 
locus or some other locus in this region, such as TCNII. 

For some other folate, cobalamin, or pyridoxine relevant genes, physical or genetic 
studies of schizophrenia have identified chromosomal regions near the gene. 

20 DISCUSSION 

The folate-cobalamin hypothesis for schizophrenia is attractive because it suggests 
that a single mechanism of genetic and environmental factors may play a major role 
in the etiology and pathogenesis of schizophrenia. The combined result of this 
mechanism is to damage fetal development, especially brain development by 

25 inhibiting nucleic acid synthesis, by affecting gene methylations, by increasing 
susceptibility to infection, and/or by producing teratogens. 
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This mechanism addresses several puzzling features of schizophrenia such as the 
season of birth effect, the association with famine and influenza epidemics, the 
negative association with rheumatoid arthritis, the associations with obstetrical 
abnormalities, social class, and urban environment. The mechanism also suggests 
5 approaches to diagnostic testing, to prevention, and to improved therapy. 

It is not excluded that such a mechanism could also apply to a number of common 
human developmental disorders that have been shown to have a genetic component to 
their etiology but whose mode of inheritance has been difficult to determine and for 
which linkage studies have met with unexpected difficulties or have achieved limited 

10 success. These developmental disorders include Tourette's syndrome & related 
disorders (e.g. obsessive-compulsive disorder and chronic multiple tics syndrome) 
[Pauls, Adv Neurol 58:151-157 (1992); McMahon et al, Adv Neurol 58:159-165 
(1992); Heutink et al, Am J Hum Genet, 57:465-473 (1995); Grice et al, Am J Hum 
Genet, 59:644-652 (1996)], learning disorders, including dyslexia [Lewis, et al, 

15 BehavGenet, 23:291-297 (1993); Pennington, J Child Neurol 10 Suppl, 1:S69-S77 
(1995)], conduct disorder [Lombroso et al, J. Am. Acad. Child Adolesc. Psychiatry, 
33:921-938 (1994)], attention-deficit hyperactivity disorder [Lombroso et al, 1994, J. 
Am. Acad. Child Adolesc. Psychiatry, 33:921-938 (1994)], bipolar illness [Baron, 
Acta. Psychiatr. Scand., 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 

20 40:313-316 (1996); Risch and Botstein, Nature Genet., 12:351-353 (1996); Jamison 
and Mclnnis, Nature Med., 2:521-522 (1996); Morell, Science, 272:31-32 (1996)], 
autism [Lombroso et al, 1994, J. Am. Acad. Child Adolesc. Psychiatry, 33:921-938 
(1994)], and obsessive-compulsive disorder in adults [Lombroso et al, 1994, J. Am. 
Acad. Child Adolesc. Psychiatry, 33:921-938 (1994)]. Some of these disorders have 

25 been shown to be associated with schizophrenia. 

The present invention may be better understood by reference to the following non- 
limiting Examples, which are provided as exemplary of the invention. The following 
Examples are presented in order to more fully illustrate one embodiment of the 
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invention. They should in no way be construed, however, as limiting the broad scope 
of the invention. 



EXAMPLE 1 
DIAGNOSING SCHIZOPHRENIA 

5 Structure of Datafiles 

Data are arranged in a file suitable for input into a binary logistic regression program 
(Table 8). A model is created consisting of those explanatory variables actually 
available from the specific patient-to-be-diagnosed and family members participating 
in the testing. This new combined data set (reference data set + data from 

1 0 patient-to-be-diagnosed with participating family members) is analyzed by binary 
logistic regression for the model chosen giving the predicted probability that a 
proband is affected with schizophrenia for all of the probands including the 
patient-to-be-diagnosed. 

The model can be modified if required. The goodness of fit for the 
1 5 patient-to-be-diagnosed is checked. The predicted probability that the 

patient-to-be-diagnosed has schizophrenia is compared with a classification table 
generated from the model used to determine likelihood of false positives and false 
negatives. The predicted probability that the patient-to-be-diagnosed is affected with 
schizophrenia, with likelihood of false positive or false negative result, is returned to 
20 the clinician. 
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TABLE 8 

A HYPOTHETICAL PARTIAL REFERENCE DATA SET OF GENETIC 
EXPLANATORY VARIABLES TO ILLUSTRATE DATA STRUCTURE 
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11 ... 

n 

For each proband (Table 8), the record contains several variables: 
identification number (ID) of the proband. 

a binary response variable (resp) for affection status of the proband: response=l , if the 
proband is affected with schizophrenia; response=0 if proband is unaffected {i.e. a control individual). 
The proband is not necessarily one of the individuals for whom genotype data (explanatory variables) 
are available. The patient-to-be-diagnosed is assigned response=0 when added to the reference data 
set. 

a set of explanatory variables: i.e. sets of genotypes of mutations found in the schizophrenia 
patients and family members and controls and family members. The schizophrenia patients and the 
control individuals are probands (P) as is the patient-to-be-diagnosed. Unaffected family members are 
the proband's mother (M), father (F), sib(s) (S 1 , S2, etc.), child(ren) (C 1 , C2, etc.) or other relatives. 
Data for affected family members, e.g. the proband's mother (MA), father (FA), sibs (SA1, SA2, etc.), 
children (CA1, CA2, etc.), or other relatives, are entered as separate explanatory variables. 
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TABLE 8 

A HYPOTHETICAL PARTIAL REFERENCE DATA SET OF GENETIC 
EXPLANATORY VARIABLES TO ILLUSTRATE DATA STRUCTURE 

ID resp Pill PI 12 P211 P212 Mill Ml 12 M311 F511 S2-411 CA1-111 

I I 1 0 1 1 1 1 0 0 1 1 
21 10000001 0 0 
311110100111 
41 000 0001 00 0 
51 00 1 11 1000 1 
60 1 0 0 0 0 0 0 0 0 0 
700000001000 
8000 1 00001 1 0 

9 0 1 0 0 0 1 0 0 0 11 

10 0000 0 1 000 1 0 

II ... 



n 



For each proband (Table 8), the record contains several variables: 
identification number (ID) of the proband. 

a binary response variable (resp) for affection status of the proband; response=l, if the 
proband is affected with schizophrenia; response=0 if proband is unaffected {i.e. a control individual). 
The proband is not necessarily one of the individuals for whom genotype data (explanatory variables) 
are available. The patient-to-be-diagnosed is assigned response=0 when added to the reference data 
set. 

a set of explanatory variables: i.e. sets of genotypes of mutations found in the schizophrenia 
patients and family members and controls and family members. The schizophrenia patients and the 
control individuals are probands (P) as is the patient-to-be-diagnosed. Unaffected family members are 
the proband's mother (M), father (F), sib(s) (SI, S2, etc.), child(ren) (CI, C2, etc.) or other relatives. 
Data for affected family members, e.g. the proband's mother (MA), father (FA), sibs (SA1, SA2, etc.), 
children (CA1 , CA2, etc.), or other relatives, are entered as separate explanatory variables. 
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Genetic explanatory variables: Each individual has 0, 1, or 2 copies of any given 
mutation allele at a given locus. Thus a genotype at each locus contributes two 
independent explanatory variables. Most of the affected family members will be 
relatives of schizophrenia probands, but occasionally a relative of an unaffected 
5 proband will turn out to be affected with schizophrenia. 

Mutations are tabulated as explanatory variables: (see Table 8): 

(i) by the proband or relative in whom they occur, (e.g. P, M, F, S2, CI , MA, FA, 
SA1,C Another); 

(ii) by the specific folate, cobalamin, or pyridoxine gene locus in which they 

10 occur (e.g. 1=DHFR locus, 2=MTHFR locus, 3-TCN2 locus, 4=MTR locus, 5=CBS 
locus, etc.); 

(iii) by the specific mutation within a locus (e.g., l=the first-designated mutation 
within a locus, 2=the second-designated mutation within a locus, etc.); and 

(iv) by whether the individual has a single or double dose of the mutation. Thus 
15 an explanatory variable P321 records whether the proband has a single dose of the 

second-designated mutation of the third-designated locus, i.e. TCN2. A variable 
M312 records whether the proband's mother has a double dose of the first-designated 
TCN2 mutation studied. 

In the present hypothetical reference dataset illustrated of genetic explanatory 
20 variables (Table 8), partial genotype data for probands, mothers, fathers, sibs and 
children are given for five gene loci. Not all of the possible explanatory variables are 
shown. Probands 1-5 are unrelated individuals with the definite clinical diagnosis of 
schizophrenia; probands 6-10 are unrelated unaffected (control) individuals. 
Probands 1, 2, 3, 6 and 9 all have a single copy of the first-designated DHFR 
25 mutation; proband 3 also has a second copy of that mutation. Probands 1 , 3, 5 and 8 
all have a single copy of the first-designated mutation at the MTHFR locus; probands 
1 and 5 also have a second copy of that mutation. Mothers of probands 1, 3, 5, 9 and 
10 all have a single copy of the first-designated DHFR mutation; mothers of probands 
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1 and 5 also have a second copy of this mutation. Mothers of probands 4 and 7 each - 
have a single copy of the first-designated mutation of TCN2; data for a double dose 
are not shown. The fathers of probands 2, 3, and 8 each have a single copy of the first 
designated mutation of CBS; data for a double dose are not shown. The second 

5 (unaffected) sibs of probands 1, 3, 8, 9, and 10 each have a single copy of the 
first-designated mutation of MTR; data for a double dose are not shown. The first 
affected children of probands 1, 3, 5, and 9 each have a single copy of the 
first-designated mutation of DHFR. Other susceptibility loci and mutations can be 
incorporated in Table 8 in the same fashion e.g., cytokine gene mutations or 

10 polymorphisms, or major histocompatibility complex (MHC) mutations or 
polymorphisms. 

Environmental explanatory variables: If only genetic explanatory variables 
(genotype data) are used, the maximum predicted probability that the proband is 
affected with schizophrenia is expected to be approximately about 0.5 in most 
1 5 populations. When environmental risk factors are included as explanatory variables, 
the maximum predicted probability that the proband is affected with schizophrenia 
may approach 1.0. Examples of environmental risk factors for a schizophrenia patient 
include: 

( 1 ) the proband's dietary folate/cobalamin/pyridoxine intake. 
20 (2) the proband's circulating levels of folate/cobalamin/pyridoxine. 

(3) the proband's circulating levels of homocysteine, methylmalonic acid, or 
cystathionine. Elevated levels are indicators of subtle folate/cobalamin deficiency. 

(4) the proband's mother's dietary folate/cobalamin/pyridoxine intake at the time 
of patient diagnosis, during a pregnancy, or during the pregnancy that produced the 

25 proband. 

(5) the proband's mother's circulating levels of homocysteine, methylmalonic 
acid, or cystathionine at the time of patient diagnosis, during a pregnancy, or during 
the pregnancy that produced the proband. 
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(6) dietary or circulating folate/cobalamin/pyridoxine or circulating levels of 
homocysteine, methylmalonic acid, or cystathionine for other family members. 

(7) epidemiological factors related to the probanda gestation and birth, e.g. low 
birth weight or preterm birth, maternal infection, maternal smoking (associated with 

5 low plasma folate), season of birth (late winter or spring births are more common in 
schizophrenia), etc. 

Method of Data Analysis 
The method exemplified herein is based upon the published guide for the SAS 
system, but other software can be used. The dataset is analyzed using binary logistic 
1 0 regression to model the response probability, p i? that the ith proband's affection status 
is 1, i.e. the probability that the ith proband has schizophrenia, given the vector of 
explanatory variables, Xj. That is: 

p i = Prob(y i =l|x i ). 

To do this the logit transformation of p 1 is modeled as a linear function of the 
1 5 explanatory variables in the vector, x f : 

logit (p^ = log (p/tl-pj) = alpha + beta'x ; 
where: alpha is the intercept parameter and 

beta is the vector of slope parameters. 
In SAS, the "descending" option is used to model the probability that the response=l, 
20 as in the present analysis, rather than response=0. 

Outputs of binary logistic regression analysis 
After analysis of a dataset, the outputs obtained from SAS include: 

(a) Estimates and standard errors of the parameters (alpha and beta). 
Using estimates of the intercept parameter (alpha) and the slope parameter (beta) for 

25 each environmental or genetic risk factor, the logistic regression equation for the 
dataset can be written. 

(b) Significance tests of the parameters (e.g. Wald chi-square). From the 
corresponding p- values, the level of significance of each of the environmental or 
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(6) dietary or circulating folate/cobalamin/pyridoxine or circulating levels of 
homocysteine, methylmalonic acid, or cystathionine for other family members. 

(7) epidemiological factors related to the proband's gestation and birth, e.g. low 
birth weight or preterm birth, maternal infection, maternal smoking (associated with 

5 low plasma folate), season of birth (late winter or spring births are more common in 
schizophrenia), etc. 

Method of Data Analysis 
The method exemplified herein is based upon the published guide for the SAS 
system, but other software can be used. The dataset is analyzed using binary logistic 
10 regression to model the response probability, p i5 that the ith proband's affection status 
is 1, i.e. the probability that the ith proband has schizophrenia, given the vector of 
explanatory variables, x it That is: 

p i = Prob(y i =l|x i ). 

To do this the logit transformation of p' is modeled as a linear function of the 
1 5 explanatory variables in the vector, Xji 

logit (pO = log (p/[l-Pi]) = alpha + beta'X; 
where: alpha is the intercept parameter and 

beta is the vector of slope parameters. 
In SAS, the "descending" option is used to model the probability that the response=l, 
20 as in the present analysis, rather than response=0. 

Outputs of binary logistic regression analysis 
After analysis of a dataset, the outputs obtained from SAS include: 

(a) Estimates and standard errors of the parameters (alpha and beta). 
Using estimates of the intercept parameter (alpha) and the slope parameter (beta) for 

25 each environmental or genetic risk factor, the logistic regression equation for the 
dataset can be written. 

(b) Significance tests of the parameters (e.g. Wald chi-square). From the 
corresponding p-values, the level of significance of each of the environmental or 
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genetic risk factors is determined. A global significance test of the data with 
corresponding p-value is also determined. 

(c) Odds ratios are given for the slope parameters of each environmental 
or genetic risk factor. Thus the amount contributed by each environmental or genetic 
risk factor to the risk of schizophrenia is determined. 

(d) The confidence limits for regression parameters and odds ratios are 
determined. 

(e) The predicted probabilities of the observations can be computed, i.e. 
the probability that each individual in the dataset has schizophrenia: 

alpha- = estimate of the intercept parameter; 
beta- = vector of the estimates of the slope parameters; 
x = vector of the explanatory variables; 
p~ = predicted probabilities 
1 

(f) The model is modified by adding or removing variables until a model 
is found that best fits the data; 

(g) The model is tested for goodness-of-fit. Also, the degree of influence 
of each specific observation is tested to detect extreme or ill-fitting observations. 
These may be examples of data entry errors or alternatively, observations that do not 
fit the present model for schizophrenia. 

(h) The probability that a new individual (the patient-to-be-diagnosed) is 
schizophrenic is then calculated from the final, modified, best fitting regression 
equation based upon parameters derived from a corrected/modified data set. A simple 
method of doing this is to add the data for the patient-to-be-diagnosed to the reference 
data set, a large group of well-studied schizophrenia probands, schizophrenia family 
members, control probands and control family members for whom data are available 
for many explanatory variables. A model is created consisting of those informative 
explanatory variables actually available from the specific patient-to-be-diagnosed and 
family members participating in the testing. This new combined data set (reference 
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genetic risk factors is determined. A global significance test of the data with 
corresponding p-value is also determined. 

(c) Odds ratios are given for the slope parameters of each environmental 
or genetic risk factor. Thus the amount contributed by each environmental or genetic 
risk factor to the risk of schizophrenia is determined. 

(d) The confidence limits for regression parameters and odds ratios are 
determined. 

(e) The predicted probabilities of the observations can be computed, /'.e. 
the probability that each individual in the dataset has schizophrenia: 

alpha- = estimate of the intercept parameter; 
beta- = vector of the estimates of the slope parameters; 
x = vector of the explanatory variables; 
p~ = predicted probabilities 
1 

p~ r+~exp(alpha~ - beta-'x) 

(f) The model is modified by adding or removing variables until a model 
is found that best fits the data; 

(g) The model is tested for goodness-of-fit. Also, the degree of influence 
of each specific observation is tested to detect extreme or ill-fitting observations. 
These may be examples of data entry errors or alternatively, observations that do not 
fit the present model for schizophrenia. 

(h) The probability that a new individual (the patient-to-be-diagnosed) is 
schizophrenic is then calculated from the final, modified, best fitting regression 
equation based upon parameters derived from a corrected/modified data set. A simple 
method of doing this is to add the data for the patient-to-be-diagnosed to the reference 
data set, a large group of well-studied schizophrenia probands, schizophrenia family 
members, control probands and control family members for whom data are available 
for many explanatory variables. A model is created consisting of those informative 
explanatory variables actually available from the specific patient- to-be-diagnosed and 
family members participating in the testing. This new combined data set (reference 
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data set + data from patient-to-be-diagnosed with participating family members) is 
analyzed by binary logistic regression for the model chosen giving the predicted 
probability that a proband is affected with schizophrenia for all of the probands 
including the patient-to-be-diagnosed. 

5 (i) A classification table is produced from the data set by the "jack 

knifing" procedure or an approximation to it. This procedure classifies each 
observation as an event or nonevent based on the model that omits the observation 
being classified. A classification table sorts observations into percent correct, percent 
false positives, and percent false negatives at various probability levels and computes 

10 sensitivity and specificity. 

(j) The data set used for diagnostic testing is constantly being updated 
and the regression equation corrected. For example, stratification by geographic 
residence or geographic origin of ancestors must be considered for some 
environmental or genetic risk factor. 

1 5 For example, in Table 9, entries 34-43 are shown for the data file containing 

genotypes of 38 schizophrenic probands plus 21 1 control probands; the first 38 are 
the affected probands. For individual 302088, the proband is affected ("1 "); there is a 
single dose ("1") of the DHFR mutation but not a double dose ("0") and a single dose 
("1") of the MTHFR mutation but not a double dose ("0"). The number 302088 

20 identifies the individual whose genotypes are listed; the proband, in this case, is the 
same individual. 
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data set + data from patient-to-be-diagnosed with participating family members) is 
analyzed by binary logistic regression for the model chosen giving the predicted 
probability that a proband is affected with schizophrenia for all of the probands 
including the patient-to-be-diagnosed. 

(i) A classification table is produced from the data set by the "jack 
knifing" procedure or an approximation to it. This procedure classifies each 
observation as an event or nonevent based on the model that omits the observation : 
being classified. A classification table sorts observations into percent correct, percent 
false positives, and percent false negatives at various probability levels and computes 
sensitivity and specificity. 

(j) The data set used for diagnostic testing is constantly being updated 
and the regression equation corrected. For example, stratification by geographic 
residence or geographic origin of ancestors must be considered for some 
environmental or genetic risk factor. 

For example, in Table 9, entries 34-43 are shown for the data file containing 
genotypes of 38 schizophrenic probands plus 21 1 control probands; the first 38 are 
the affected probands. For individual 302088, the proband is affected ("1"); there is a 
single dose ("1") of the DHFR mutation but not a double dose ("0") and a single dose 
(T) of the MTHFR mutation but not a double dose ("0"). The number 302088 
identifies the individual whose genotypes are listed; the proband, in this case, is the 
same individual. 
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TABLE 9 

SAS DATAFILE FOR SCHIZOPHRENIA PATIENTS AND CONTROLS 



34 


302086 


1 


1 


0 


1 


1 


35 


302088 


1 


1 


0 


1 


0 


36 


302110 


1 


1 


0 


1 


0 


37 


302111 


1 


1 


0 


0 


0 


38 


302136 


1 


1 


1. 


1 


0 


39 


100001 


0 


1 


0 


0 


0 


40 


100061 


0 


0 


0 


0 


0 


41 


100064 


0 


1 


0 


1 


0 


42 


100067 


0 


0 


0 


1 


0 


43 


100073 


0 


1 


0 


0 


0 



In Table 10, entries 31-40 are shown for the data file containing genotypes of 35 
20 mothers of schizophrenic probands plus (the same) 21 1 control probands. For 

individual 302083, the proband is affected ("1"); there is a single dose of the DHFR 
mutation ("1) but not a double dose ("0"); there is neither a single ("0") nor a double 
("0") dose of the MTHFR mutation. The number 302083 identifies the individual 
whose genotypes are listed, a mother; the proband, in this case, is a different 
25 individual, her affected child. 
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TABLE 9 

SAS DATAFILE FOR SCHIZOPHRENIA PATIENTS AND CONTROLS 



34 


302086 


1 


1 


0 


1 


1 


35 


302088 


1 


1 


0 


1 


0 


36 


302110 


1 


1 


0 


1 


0 


37 


302111 


1 


1 


0 


0 


0 


38 


302136 


1 


1 


1 


1 


0 


39 


100001 


0 


1 


0 


0 


0 


40 


100061 


0 


0 


0 


0 


0 


41 


100064 


0 


1 


0 


1 


0 


42 


100067 


0 


0 


0 


1 


0 


43 


100073 


0 


1 


0 


0 


0 



In Table 10, entries 31-40 are shown for the data file containing genotypes of 35 
20 mothers of schizophrenic probands plus (the same) 21 1 control probands. For 

individual 302083, the proband is affected ("I"); there is a single dose of the DHFR 
mutation ("1) but not a double dose ("0"); there is neither a single ("0") nor a double 
("0") dose of the MTHFR mutation. The number 302083 identifies the individual 
whose genotypes are listed, a mother; the proband, in this case, is a different 
25 individual, her affected child. 
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TABLE 10 

SAS DATAFILE FOR SCHIZOPHRENIA MOTHERS AND CONTROLS 



31 


302083 


1 


1 


0 


0 


0 


32 


302103 


1 


0 


0, 


1 


0 


33 


302104 


1 


0 


0 


1 


0 


34 


302105 


1 


1 ' 


0 


1 


0 


35 


302120 


1 


0 


0 


0 


0 


36 


100001 . 


0 


1 


0 


0 


0 


37 


100061 


0 


0 


0 


0 


0 


38 


100064 


0 


1 


0 


1 


0 


39 


100067 


0 


0 


0 


1 


0 


40 


100073 


0 


1 


0 


0 


0 



In Table 1 1 , entries 1 1-20 are shown for the data file containing genotypes of 15 
fathers of schizophrenic probands plus (the same) 21 1 control probands. For 
individual 302084, the proband is affected ('T*); there is a single dose ("1") but not a 
double dose ("0") of the DHFR mutation; there is both a single ("1°) and a double 
dose ("1") of the MTHFR mutation. The number 302084 identifies the individual 
whose genotypes are listed, a father; the proband, in this case, is a different 
individual, his affected child. 



WO 00/71754 PCT/US00/14354 

81 

TABLE 10 

SAS DATAFILE FOR SCHIZOPHRENIA MOTHERS AND CONTROLS 



31 


302083 


1 


1 


0 


0 


0 


32 


302103 


1 


0 


0 


1 


0 


33 


302104 


1 


0 


0 


1 


0 


34 


302105 


1 


1 


0 


1 


0 


35 


302120 


1 


0 


0 


0 


0 


36 


100001 


0 


1 


0 


0 


0 


37 


100061 


0 


0 


0 


0 


0 


38 


100064 


0 


1 


0 


1 


0 


39 


100067 


0 


0 


0 


1 


0 


40 


100073 


0 


1 


0 


0 


0 



In Table 1 1 , entries 1 1-20 are shown for the data file containing genotypes of 15 
fathers of schizophrenic probands plus (the same) 21 1 control probands. For 
20 individual 302084, the proband is affected (" 1 "); there is a single dose (" 1 ") but not a 
double dose ("0") of the DHFR mutation; there is both a single ("1") and a double 
dose (" 1 ") of the MTHFR mutation. The number 302084 identifies the individual 
whose genotypes are listed, a father; the proband, in this case, is a different 
individual, his affected child. 
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TABLE 1 1 

SAS DATAFILE FOR SCHIZOPHRENIA FATHERS AND CONTROLS 



11 


302102 


1 


0 


0 


0 


0 


12 


302106 


1 


1 


0 


0 


0 


13 


302115 


1 


1 


0 


0 


0 


14 


302117 


1 


1 


0 


0 


0 


15 


302084 


1 


1 


0 


1 


1 


16 


100001 


0 


1 


0 


0 


0 


17 


100061 


0 


0 


0 


0 


0 


18 


100064 


0 


1 


0 


1 


0 


19 


100067 


0 


0 


0 


1 


0 


20 


100073 


0 


1 


0 


0 


0 



In Table 12, entries 9-18 are shown for the data file containing genotypes of 13 
unaffected sibs of schizophrenic probands plus (the same) 211 control probands. For 
20 individual 302089, the proband is affected ("1"); there is a single dose ("1") but not a 
double dose ("0") of the DHFR mutation; there is both a single (" 1 ")■ and a double 
dose ("1") of the MTHFR mutation. The number 302089 identifies the individual 
whose genotypes are listed, an unaffected sib; the proband, in this case, is a different 
individual, the affected sib of individual 302089. 
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TABLE 11 

SAS DATAFILE FOR SCHIZOPHRENIA FATHERS AND CONTROLS 



11 


302102 


1 


0 


0 


0 


0 


12 


302106 


1 


1 


0 


0 


0 


13 


302115 


1 


1 


0 


0 


0 


14 


302117 


1 


1 


0 


0 


0 


15 


302084 


1 


1 


0 


1 


1 


16 


100001 


0 


1 


0 


0 


0 


17 


100061 


0 


0 


0 


0 


0 


18 


100064 


0 


1 


0 


1 


0 


19 


100067 


0 


0 


0 


1 


0 


20 


100073 


0 


1 


0 


0 


0 



In Table 12, entries 9-18 are shown for the data file containing genotypes of 13 
unaffected sibs of schizophrenic probands plus (the same) 21 1 control probands. For 
20 individual 302089, the proband is affected (" 1 "); there is a single dose ( M 1 ") but not a 
double dose ("0") of the DHFR mutation; there is both a single (" 1 ") and a double 
dose ( H 1 H ) of the MTHFR mutation. The number 302089 identifies the individual 
whose genotypes are listed, an unaffected sib; the proband, in this case, is a different 
individual, the affected sib of individual 302089. 



WO 00/71754 



PCT/US00/14354 



83 

TABLE 12 

SAS DATAFILE FOR SCHIZOPHRENIA SIBS AND CONTROLS 



09 


302071 




1 


1 


0 


0 


10 


302073 


1 


0 


0 


1 


0 


11 


302089 


1 


1 


0 


1 


1 


12 


302118 


1 


1 


0 


0 


0 


13 


302126 


1 


1 


0 


0 


0 


14 


100001 


0 


1 


0 


0 


0 


15 


100061 


0 


0 


0 


0 


0 


16 


100064 


0 


1 


0 


1 


0 


17 


100067 


0 


0 


0 


1 


0 


18 


100073 


0 


1 


0 


0 


0 



In Tables 9-12 for individual 100061, the proband is unaffected ("0"); there is neither 
a single dose ("0") nor a double dose ("0") of the DHFR mutation; there is neither a 
single dose ("0") nor a double dose ("0") of the MTHFR mutation. Since the proband 
is unaffected, this is a control individual. The number 100061 identifies the individual 
whose genotypes are listed, as a control individual; the proband, in this case, is the 
same individual. The identical group of control individuals is used for all four 
comparisons. 
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TABLE 12 

SAS DATAHLE FOR SCHIZOPHRENIA SIBS AND CONTROLS 



09 


302071 




1 


1 


0 


0 


10 


302073 


1 


0 


0 


1 


0 


11 


302089 


1 


1 


0 


1 


1 


12 


302118 


1 


1 


0 


0 


0 


13 


302126 


1 


I 


0 


0 


0 


14 


100001 


0 


1 


0 


0 


0 


15 


100061 


0 


0 


0 


0 


0 


16 


100064 


0 


1 


0 


1 


0 


17 


100067 


0 


0 


0 


1 


0 


18 


100073 


0 


1 


0 


0 


0 



In Tables 9-12 for individual 1 00061, the proband is unaffected ("0"); there is neither 
a single dose ("0") nor a double dose ("0") of the DHFR mutation; there is neither a 
single dose ("0") nor a double dose ("0") of the MTHFR mutation. Since the proband' 
is unaffected, this is a control individual. The number 100061 identifies the individual 
whose genotypes are listed, as a control individual; the proband, in this case, is the 
same individual. The identical group of control individuals is used for all four 
comparisons. 
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EXAMPLE 2 

Distribution of Folate Gene Polymorphism Genotypes Among Schizophrenics. 
Schizophrenia Parents. Schizophrenia Sibs. and Controls 

Summary 

5 The DNA polymorphism-Diet-Cofactor-Development hypothesis (DDCD hypothesis, 
described above) postulates that schizophrenia results in part from developmental 
brain damage sustained in utero from the aggregate effect of maternal defects of 
genes related to important cdfactors, e.g. folate, cobalamin, pyridoxins potentiated by 
a maternal dietary deficiency of these cofactors. The maternal damage to the fetus 

10 results in part from insufficiency of these cofactors themselves and in part from 
resulting effects such as immune deficiency and maternal teratogens, e.g. 
hyperhomocysteinemia. Genes from either parent acting in the fetus may modify 
these damaging effects as outlined in the gene-teratogen model (described above). 

The hypothesis addresses all of the unusual biological and epidemiological features of 
1 5 schizophrenia: e.g. the decreased amount of grey matter in brain areas, the unusual 
birth-month effect, the geographical differences in incidence, the socioeconomic 
predilection, the association with obstetrical abnormalities (low birth weight and 
prematurity), the decreased incidence of rheumatoid arthritis, and the association with 
viral epidemics (described above). 

20 The hypothesis can be supported by finding significant association of sequence 
variants of folate, cobalamin, or pyridoxine genes with schizophrenia. Folate, 
cobalamin, and pyridoxine absorption, transport, and metabolism are complex 
[Rosenblatt, In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et 
al. (eds), New York: McGraw-Hill, pp. 3 1 1 1-3 128 (1995); Benton and Rosenberg, In: 

25 The Metabolic and Molecular Bases of Inherited Disease, Scriver et al. (eds), New 
York: McGraw-Hill, pp. 3129-3149 (1995); Whyte et al, Hypophosphatasia, In: The 
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EXAMPLE 2 

Distribution of Folate Gene Polymorphism Genotypes Among Schizophrenics. 
Schizophrenia Parents, Schizophrenia Sibs. and Controls 

Summary 

5 The DNA polymorphism-Diet-Cofactor-Development hypothesis (DDCD hypothesis, 
described above) postulates that schizophrenia results in part from developmental 
brain damage sustained in utero from the aggregate effect of maternal defects of 
genes related to important cofactors, e.g. folate, cobalamin, pyridoxine, potentiated by 
a maternal dietary deficiency of these cofactors. The maternal damage to the fetus 

10 results in part from insufficiency of these cofactors themselves and in part from 
resulting effects such as immune deficiency and maternal teratogens, e.g. 
hyperhomocysteinemia. Genes from either parent acting in the fetus may modify 
these damaging effects as outlined in the gene-teratogen model (described above). 

The hypothesis addresses all of the unusual biological and epidemiological features of 
15 schizophrenia: e.g. the decreased amount of grey matter in brain areas, the unusual 
birth-month effect, the geographical differences in incidence, the socioeconomic 
predilection, the association with obstetrical abnormalities (low birth weight and 
prematurity), the decreased incidence of rheumatoid arthritis, and the association with 
viral epidemics (described above). 

20 The hypothesis can be supported by finding significant association of sequence 
variants of folate, cobalamin, or pyridoxine genes with schizophrenia. Folate, 
cobalamin, and pyridoxine absorption, transport, and metabolism are complex 
[Rosenblatt, In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et 
al. (eds), New York: McGraw-Hill, pp. 31 11-3128 (1995); Benton and Rosenberg, In: 

25 The Metabolic and Molecular Bases of Inherited Disease, Scriver et al (eds), New 
York: McGraw-Hill, pp. 3129-3149 (1995); Whyte et al, Hypophosphatasia, In: The 
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Metabolic and Molecular Bases of Inherited Disease, Scriver et al (eds), New York: 
McGraw-Hill pp. 4095-41 11] with multiple transport proteins, enzymes, and 
regulatory components. A strong candidate for harboring a mutation predisposing to 
schizophrenia is the DHFR gene coding for the folate enzyme dihydrofolate 
5 reductase. DHFR chemically reduces dietary folate converting it into a form that can 
enter cellular metabolism. DHFR is also important for DNA synthesis and is known 
to play a major role in development in utero. A novel polymorphic 19 basepair 
deletion of the DHFR gene has been isolated which could be of functional 
significance because it affects potential transcription factor binding sites. 

10 A second candidate is the MTHFR gene, coding for methylenetetrahydrofolate 
reductase, MTHFR, an important enzyme of folate metabolism. MTHFR was of 
particular interest because severe deficiency of enzyme activity has been associated 
with the "schizophrenia" phenotype [Freeman et al, N. Engl J. Med., 292:491-496 
(1975); Regland et al, J. Neural Transm. Gen. Sect., 98:143-152 (1994)] and because 

1 5 a common mutation, the nt677 C->T transition results in a mutated gene that encodes 
a heat-labile MTHFR, having decreased enzymatic activity, which in the presence of 
dietary folate deficiency, causes the plasma homocysteine of homozygotes to become 
elevated [van der Put et al, Lancet, 346:1070-1071 (1995); Frosst et al, Nature 
Genet, 10:111-113 (1995)]. In adults, hyperhomocysteinemia is known to cause 

20 vascular disease and to be toxic [Frosst et al, Nature Genet, 10: 111-113 (1995)]. 

Therefore, homocysteine that crosses the placenta could act as a fetal teratogen during 
pregnancy. Maternal folate deficiency could also have a more direct teratogenic 
effect through fetal folate deprivation. These effects could be potentiated by 
abnormalities of other folate, cobalamin, or pyridoxine genes, even if these 

25 abnormalities were only minor. 

Materials & Methods : 
1 . Subjects and Sample Collection: Patients with schizophrenia and unaffected family 
members of schizophrenics, were ascertained from patient facilities, patient support 
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Metabolic and Molecular Bases of Inherited Disease, Scriver et al. (eds), New York: 
McGraw-Hill pp. 4095-41 1 1 ] with multiple transport proteins, enzymes, and 
regulatory components. A strong candidate for harboring a mutation predisposing to 
schizophrenia is the DHFR gene coding for the folate enzyme dihydrofolate 
5 reductase. DHFR chemically reduces dietary folate converting it into a form that can 
enter cellular metabolism. DHFR is also important for DNA synthesis and is known 
to play a major role in development in utero. A novel polymorphic 19 basepair 
deletion of the DHFR gene has been isolated which could be of functional 
significance because it affects potential transcription factor binding sites. 

10 A second candidate is the MTHFR gene, coding for methylenetetrahydrofolate 
reductase, MTHFR, an important enzyme of folate metabolism. MTHFR was of 
particular interest because severe deficiency of enzyme activity has been associated 
with the "schizophrenia" phenotype [Freeman et al, N. Engl J. Med. f 292:491-496 
(1975); Regland et al, J. Neural Transm. Gen. Sect., 98:143-152 (1994)] and because 

1 5 a common mutation, the nt677 C->T transition results in a mutated gene that encodes 
a heat-labile MTHFR, having decreased enzymatic activity, which in the presence of 
dietary folate deficiency, causes the plasma homocysteine of homozygotes to become 
elevated [van der Put et al, Lancet., 346: 1 070-1 07 1 (1 995); Frosst et al, Nature 
Genet., 10:111-113 (1995)]. In adults, hyperhomocysteinemia is known to cause 

20 vascular disease and to be toxic [Frosst et al, Nature Genet, 10:1 1 1-1 13 (1995)]. 

Therefore, homocysteine that crosses the placenta could act as a fetal teratogen during 
pregnancy. Maternal folate deficiency could also have a more direct teratogenic 
effect through fetal folate deprivation. These effects could be potentiated by 
abnormalities of other folate, cobalamin, or pyridoxine genes, even if these 

25 abnormalities were only minor. 

Materials & Methods : 
1. Subjects and Sample Collection: Patients with schizophrenia and unaffected family 
members of schizophrenics, were ascertained from patient facilities, patient support 
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groups, and family support group organizations. Nearly all schizophrenia families 
had only a single case of schizophrenia. The patients came from different 
schizophrenia families than the parents and sibs. The controls were unaffected and 
unrelated individuals not known to be schizophrenic or related to patients with 
5 schizophrenia or spina bifida. All subjects were of Caucasian background except two 
of the schizophrenia patients who were of African American background. 

After informed consent was obtained, 20-40 ml of blood was collected into EDTA 
(purple-top) vacutainers, placed on ice immediately, and transported to the laboratory 
where plasma, packed red cells, and buffy coat were separated by centrifugation and 
10 frozen at -80°C. 

2. Detection of Alleles: DNA was isolated using the QIAmp column DNA extraction 
procedure or the QIAGEN Genomic-tip method (QIAGEN, Chats worth, CA). Alleles 
for a newly detected polymorphic 19 bp deletion in the dihydrofolate reductase 
(DHFR) gene were determined by polymerase chain reaction (PCR) amplification of 

15 the region surrounding the deletion using specific primers (Fig 1) and direct detection 
of the PCR products after separation of products on a non-denaturing polyacrylamide 
gel. A Cetus - Perkin-Elmer 9600 thermocycler was used. Briefly, the PCR reaction 
contained 200 uM dNTPs, 1 .5 mM MgCl 2 , 1 0 pmols of each primer, in 1 0 ul reaction 
volume. The PCR conditions used were denaturation at 94°C for 6 min. initially, 

20 followed by 35 cycles of 94°C for 55 sec, 60°C for 55 sec, and 72°C for 55 sec and a 
final extension at 72°C for 1 2 min. 

Alleles for the 677C->T transition of the methylenetetrahydrofolate reductase 
(MTHFR) gene were determined by cleavage with the restriction endonuclease, 
Hinfl, of PCR-amplified genomic DNA from blood and separation of the products by 
25 non-denaturing polyacrylamide gel electrophoresis [Frosst et al f Nature Genet, 
10:111-113 (1995)]. 
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3. Sequencing the Region Around the DHFR Deletion: Using the same primers 
(Figure 1), genomic DNA from individuals with 1,1 and 2,2 genotypes was amplified 
by PCR and the products sequenced using an ABI PRISM 377 automated sequencer. 
Restriction sites were identified using the MAP Program in the GCG Package. 

5 Potential transcription factor binding sites were detected with the TESS program 
(transcription element search software, 
URL:http://agave.humgen.upenn.edu/tess/index.html). 

4. Data Analysis: Since the mode of inheritance of schizophrenia is unknown, binary 
logistic regression was used to test the DHFR deletion allele and the MTHFR 

1 0 heat-labile allele as genetic risk factors for schizophrenia. Either the DHFR deletion 
polymorphism or the MTHFR heat-labile allele could itself be a genetic risk factor for 
schizophrenia. The genotypes of the two folate gene polymorphisms were used as 
explanatory variables. Genotypes of schizophrenia patients, parents, or sibs were 
compared with those of controls. 

1 5 Four files were constructed consisting of schizophrenia patients+controls, mothers of 
schizophrenia patients+controls, fathers of schizophrenia patients+controls, and sibs 
of schizophrenia patients+controls for input into the SAS System. Each dataset 
contained 6 variables. In order, these were: 

1 . six digit identification (ID) number; 
20 2. response variable, i.e. affection status of the proband 

(0=unaffected, Le. control individual; l=affected, Le. schizophrenia patient); 

3. DHFR mutation-single dose (Ds); 

4. DHFR mutation-double dose (Dd); 

5. MTHFR mutation-single dose (Ms); and 
25 6. MTHFR mutation-double dose (Md). 

For mutation data, 0=mutation absent, l=mutation present. 
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Results 

Alleles of the DHFR 19 bp Deletion Polymorphism: Amplification of the region of 
intron 1 of DHFR defined by the primers in Figure 1 gave two polymorphic bands of 
232 and 213 bp after separation on a non-denaturing polyacrylamide gel (Figure 2). 

5 Sequencing the PCR products from the two homozygotes showed that they differed 
by 19 bp (Figure 3). The upper and lower bands (Figure 2), non-deletion allele and 
deletion allele respectively, were designated alleles 1 and 2 respectively. Comparison 
with two published sequences showed that allele 1 was identical with one of them 
[Yang et al J. Mol Biol 176:169-187 (1984)] indicating that allele 2 resulted from a 

10 19 bp deletion. The other published sequence [Chen et al 1 Biol Chem. 259:3933- 
3943 (1 984)] was lacking one base pair of allele 1, an A indicated by "*" in Fig 3. It 
is possible that this shorter reference sequence [Chen et al J. Biol Chem, 259:3933- 
3943 (1984)] resulted from a sequencing artifact. 

Sequences in the 19 bp Deleted Region of DHFR Intron 1: The 19bp sequence in the 
15 deleted region (Fig 3) of DHFR intron 1 contained sites for several restriction 

enzymes including Rsal and ScrFI, and potential binding sites for transcription factors 
including Sp 1 ,-NF-kappaB, CP 1 (NF-Y), E2F, ETF and GCF in the 1 9 base pair 
region. 

Binary Logistic Regression Analysis: The number of individuals with each genotype 
20 of the two polymorphisms among 38 unrelated schizophrenia probands, 35 unrelated 
mothers of schizophrenia probands, 15 unrelated fathers of schizophrenia probands, 
13 unrelated unaffected sibs of schizophrenia probands, and 21 1 unrelated unaffected 
control probands is shown in Table 13. 



WO 00/71754 



PCT/USOO/14354 



88 
Results 

Alleles of the DHFR 19 bp Deletion Polymorphism: Amplification of the region of 
intron 1 of DHFR defined by the primers in Figure 1 gave two polymorphic bands of 
232 and 213 bp after separation on a non-denaturing polyacrylamide gel (Figure 2). 

5 Sequencing the PCR products from the two homozygotes showed that they differed 
by 19 bp (Figure 3). The upper and lower bands (Figure 2), non-deletion allele and 
deletion allele respectively, were designated alleles 1 and 2 respectively. Comparison 
with two published sequences showed that allele 1 was identical with one of them 
[Yang et al. J. Mol Biol 176:169-187 (1984)] indicating that allele 2 resulted from a 

10 19 bp deletion. The other published sequence [Chen et al J. Biol Chem. 259:3933- 
3943 (1984)] was lacking one base pair of allele 1, an A indicated by "*" in Fig 3. It 
is possible that this shorter reference sequence [Chen et al J. Biol Chem. 259:3933- 
3943 (1984)] resulted from a sequencing artifact. 

Sequences in the 19 bp Deleted Region of DHFR Intron 1: The 19bp sequence in the 
1 5 deleted region (Fig 3) of DHFR intron 1 contained sites for several restriction 

enzymes including Rsal and ScrFI, and potential binding sites for transcription factors 
including Spl, NF-kappaB, CP1 (NF-Y), E2F, ETF and GCF in the 19 base pair 
region. 

Binary Logistic Regression Analysis: The number of individuals with each genotype 
20 of the two polymorphisms among 38 unrelated schizophrenia probands, 35 unrelated 
mothers of schizophrenia probands, 15 unrelated fathers of schizophrenia probands, 
13 unrelated unaffected sibs of schizophrenia probands, and 21 1 unrelated unaffected 
control probands is shown in Table 13. 



WO 00/71754 



PCT/US00/14354 



89 

TABLE 13 

DISTRIBUTION OF DHFR AND MTHFR MUTATION GENOTYPES 
AND ALLELES AMONG CONTROLS. SCHIZOPHRENICS. 
AND SCHIZOPHRENIA FAMILY MEMBERS 

5 DHFR 19 bp deletion polymorphism : 

-GenTyp— — — —Schizophrenia — — Ctrl — 

P M F S 

1/1 6 (.16) 10 (.29) 4 (.27) 4 (.31) 56 (.26) 

1/2 22 (.58) 13 (.37) 11 (.73) 8 (.61) 115 (.54) 

10 2/2 10 (.26) 12 (.34) 0 (0.0) 1 (.08) 40 (.19) 

total 38(1.00) 35(1.00) 15(1.00) 13 (1.00) 211 (.99) 

MTHFR 677C->T transition polymorphism : 

-GenTyp - — —Schizophrenia —Ctrl — 

P M F S 

15 1/1 14 (.37) 16 (.46) 11 (.73) 4 (.31) 103 (.49) 

1/2 18 (.47) 18 (.51) 3 (.20) 8 (.61) 78 (.37) 

2/2 6 (.16) 1 (.03) 1 (.07) 1 (.08) 30 (.14) 

total 38(1.00) 35(1.00) 15(1.00) 13(1.00) 211 (1.00) 



P=schizophrenia patients; M=mothers of schizophrenia patients; F=fathers of 
20 schizophrenia patients; S=unaffected sibs of schizophrenia patients; Ctrl=control 
individuals. 
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The four data files were analyzed using the logistic procedure of SAS (SAS Institute 
Inc., 1995) and the "descending" option, which modeled the probability that 
RESPONSE=l, that is, the probability that the proband was affected with 
schizophrenia. Note that the proband was not always the individual whose genotype 
data were used. For example, genotype data for mothers of schizophrenic probands 
were used to determine the probability that their children, the probands, were affected, 
Use of the "best" model selection options for logistic analysis in SAS gave the best 
models for two and three explanatory variables, (Table 14). 
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Table 14 

BINARY LOGISTIC REGRESSION RESULTS 

GENETIC RISK FACTOR MODEL: DsDd Ms Md 
Odds Ratio (p value) 

Schizophrenia Patients 

DsOR(p) 1.937 (.18) 

DdOR(p) 1.263 (.59) 

MsOR(p) 1.775 (.14) 

MdOR(p) 0.914 (.86) 



Mothers of Schizophrenia Patients 
DsOR(p) 0.630 (.31) 

Dd OR(p) 2.653 (.028) 

MsOR(p) 1.439 (.34) 

MdOR(p) 0.143 (.065) 



* 



Fathers of Schizophrenia Patients 
DsOR(p) 1.178 (.79) 

Dd OR(p) 0.000 (.96) 

MsOR(p) 0.366 (.14) 

Md OR(p) 0.841 (.88) 

Unaffected Sibs of Schizophrenia Patients 
DsOR(p) 1.104 (.88) 

Dd OR(p) 0.337 (.31) 

MsOR(p) 2.688 (.12) 

MdOR(p) 0.317 (.29) 
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Notes For Table 14 

DHFR 19 bp deletion : Ds=single dose; Dd=double dose 

MTHFR 677C->T mutation : Ms=single dose; Md=double dose 

Logistic regression model: 

Model with four explanatory variables (Ms, Md, Ds and Dd). 

OR(p)=odds ratio and the corresponding p-value for that odds ratio 
determination *=significant at the p<.05 level. 

0.000 odds ratios occurred since none of the fathers of schizophrenia patients 
had genotype Dd; there was a possibly quasi- complete separation in the sample 
points; the maximum likelihood estimate may not exist; and therefore validity of the 
model fit for these odds ratios was questionable. 

The comparison of mothers of schizophrenia probands with control probands was 
statistically significant. Ds was not a significant genetic risk factor. Neither Ms nor 
Md in mothers was a significant genetic risk factor. However, the p-value for Md 
decreased and approached significance (p= 065) at the p<.05 level. 

5 Predicted Probabilities of the Various Genotypes: The "probs predicted" modality of 
SAS, gave the predicted probability that the proband was affected with schizophrenia 
(response-1) given genotype data for control probands and schizophrenia patients 
(probands), mothers of schizophrenia probands, fathers of schizophrenia probands, or 
sibs of schizophrenia probands. The maximum probabilities obtained are shown in 

10 Table 15. The highest maximum predicted probability that the proband was affected 
was obtained for genotype data from mothers of schizophrenia probands, next for 
schizophrenia probands, next for fathers of schizophrenia probands, and lowest for 
sibs of schizophrenia probands. 
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TABLE 15 

MAXIMUM PREDICTED PROBABILITY 

Model P M E S 

DsDdMsMd0.24 0.29 0.12 0.11 

Model and explanatory variables are the same as in Table 14. 
********************************^ 

Determination of Genotypes Conferring the Highest Risk: The predicted probabilities 
that the proband was affected with schizophrenia given specific genotypes of control 
probands and schizophrenia probands, mothers of schizophrenia probands, fathers of 
schizophrenia probands, or sibs of schizophrenia probands were determined using the 

5 model containing all four explanatory variables (Table 16). The predicted 
probabilities that the proband was affected with schizophrenia were highest for 
maternal genotypes (Table 1 5). The maternal genotype with the highest risk was Dd 
Ms, conferring a probability of 0.29 of schizophrenia in the proband (Table 1 6). The 
Dd Ms genotype also gave the highest predicted probability, 0.24, for schizophrenia 

10 patients. 
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TABLE 16 

PREDICTED PROBABILITIES FOR SPECIFIC GENOTYPES 



Model: Ds Dd Ms Md 



Genotyp e 



Predicted 
Probability 



Schizophrenia Patients : 

Dnull + Mnull 0.07 

Dnull + Ms 0.12 

Dnull + Md 0.11 

Ds + Mnull 0.12 

Dd + Mnull 0.15 

Mothers of Schizophrenia Patients : 

Dnull + Mnull 0.16 

Dnull + Ms 0.20 

Dnull + Md 0.03 

Dd + Mnull 0.22 

Ds + Mnull 0.10 

Fathers of Schizophrenia Patients : 

Dnull + Mnull 0.10 

Dnull + Ms 0.04 

Dnull + Md 1 0.03 

Ds + Mnull 0.12 

Dd + Mnull 0.0 

Unaffected Sibs of Schizophrenia Patients : 

Dnull + Mnull 0.04 

Dnull + Ms 0.10 

Dnull + Md 0.03 

Ds + Mnull 0.04 

Dd + Mnull 0.02 



Genotype 



Ds +Ms 

Ds +Md 

Dd + Ms 

Dd +Md 



Ds .+ Ms 
Ds +Md 
Dd +Ms 
Dd +Md 



Ds + Ms 

Ds + Md 

Dd +Ms 

Dd +Md 



Ds + Ms 

Ds +Md 

Dd +Ms 

Dd +Md 



Predicted 
Probability 

0.20 
0.19 
0.24 
0.23 



0.13 
0.02 
0.29 
0.06 



0.05 
0.04 
0.0 
0.0 



0.11 
0.04 
0.04 
0.01 



Genotypes consist of the same explanatory variables described in Table 14 except that Dnull has no 
copy of the DHFR deletion and Mnull has no copy of the MTHFR 677C->T variant. Odds ratios of 
0.0 were unsatisfactory as described in Table 14. 
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Discussion 

Structure and Function of the DHFR 19 bp Deletion Polymorphism: DHFR 
polymorphisms have been reported previously [Feder et al. y Nucl Acids Res, 15:5906 
(1987); Detera-Wadleigh et al, Nucl Acids Res. 17:6432 (1989)]. It is known that 

5 introns are important for message regulation e.g., splicing, or as sites for binding 
transcription factors. Since the first intron is a relatively common location for 
regulatory elements, it is possible that the deleted region of DHFR intron 1 could play 
a role in regulation of DHFR or that the deletion could be a genetic risk factor for 
schizophrenia because it removes potential transcription factor binding sites. 

1 0 Abnormalities of transcription factors and their binding sites may play a role in 

disease. For example, a polymorphic Spl binding site in the collagen type I alpha 1 
gene has been associated with reduced bone density and osteoporosis [Grant et al., 
Nature Genet. 14:203-205 (1996)]. 

The Nature of the Putative Folate Genetic Risk Factors for Schizophrenia: Dd in the 
1 5 mother of a schizophrenia proband conferred significantly increased risk of 

schizophrenia in her child (Table 14). The findings that Dd was a genetic risk factor 
in mothers but not fathers of schizophrenia probands (Table 15) and that Dd in 
mothers gave a higher predicted probability than in schizophrenia patients, fathers or 
sibs (Tables 15 and 16) was consistent with the role of DHFR as a teratogenic locus 
20 according to the gene-teratogen model (described above). The finding that a double 
dose but not a single dose of the DHFR deletion in mothers was a genetic risk factor 
(Table 16) supported a recessive mode of action in the mother. A teratogenic locus 
acting in the mother can also act as a modifying or specificity locus in the fetus. 

Neither Ms nor Md in mothers of schizophrenia probands showed statistical 
25 significance as genetic risk factors for schizophrenia in probands (Table 14). However 
Md in mothers approached statistical significance (p=. 065) and appeared to be 
protective (odds ratio 0.14), while Ms in mothers appeared to increase risk modestly 
(odds ratio 1 .44, p=.34). 
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mothers gave a higher predicted probability than in schizophrenia patients, fathers or 
sibs (Tables 15 and 16) was consistent with the role of DHFR as a teratogenic locus 
20 according to the gene-teratogen model (described above). The finding that a double 
dose but not a single dose of the DHFR deletion in mothers was a genetic risk factor 
(Table 16) supported a recessive mode of action in the mother. A teratogenic locus 
acting in the mother can also act as a modifying or specificity locus in the fetus. 

Neither Ms nor Md in mothers of schizophrenia probands showed statistical 
25 significance as genetic risk factors for schizophrenia in probands (Table 14). However 
Md in mothers approached statistical significance (p=. 065) and appeared to be 
protective (odds ratio 0.14), while Ms in mothers appeared to increase risk modestly 
(odds ratio 1.44, p=.34). 
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Role of Genetic and Environmental Factors in Schizophrenia: Since the probability 
that a schizophrenia co-twin is also affected is reported [Gottesman, Schizophrenia 
Genesis, Schizophrenia Genesis- The Origins of Madness, W.H. Freeman & Co. 
N.Y.(1991)] to be only 48%, a large part of the risk for schizophrenia would be 

5 anticipated to come from environmental factors. Therefore, some controls should 
have the genetic risk factors for schizophrenia but not be affected with schizophrenia. 
In the present data set, 6 of 35 schizophrenia mothers and 7 of 38 schizophrenia 
patients had Dd Ms, the genotype conferring the highest risk, compared with 1.5, of 
21 1 controls. Since this genotype gave predicted probabilities of schizophrenia in 

1 0 probands of 0.29 and 0.24 respectively, polymorphisms of DHFR and MTHFR could 
account for a considerable portion of the genetic component of the risk of 
schizophrenia. 

Relation of DHFR to Cytogenetic and Linkage Data for Schizophrenia: As discussed 
above, the DHFR gene has been located on chromosome 5 at 5ql 1.2-13.2. A 

1 5 schizophrenia translocation was reported (McGilli vray et al . 1 990; Bassett, 1 992) 
affecting 5qll.2-5ql3.3. Also two-point lod scores of 4.64 and 2.29 were found 
[Sherrington et al, Nature, 336:164-167 (1988)] for the polymorphic markers D5S76 
and D5S39 respectively on chromosome 5, in this region [McGillivray et al, Am. J. 
Med Genet., 35:10-13 (1990); Bassett, Br. 1 Psychiatry, 161:323-334 (1992)]. Two 

20 other linkage studies found small positive lod scores in this region [Coon et al. t Biol 
Psychiatry, 34:277-289 (1993); Kendler and Diehl, Schizophr. Bull, 19:261-285 
(1993)], but numerous other studies excluded this region under the assumptions and 
models used [Kendler and Diehl, Schizophr. BulL 19:261-285 (1993)]. Recently, 
new studies have found suggestive evidence for a potential susceptibility locus at a 

25 different region of 5q, 5q31 [Schwab et al, Nat, Genet. 11:325-327 (1997)] and 
5q22-31 [Straub et al, Molec Psychiatr. 2:148-1 55 (1997)]. 

The case-control study presented herein illustrates the usefulness of the DNA 
polymorphism-Diet-Cofactor-Development and the gene-teratogen models described 
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above. More importantly, the results presented herein, clearly fail to reject the specific 
models, i.e., that folate gene polymorphisms can play a role in the etiology of 
schizophrenia. 

The present invention is not to be limited in scope by specific embodiments described 
herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description 
and the accompanying figures. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications in addition to the immediately foregoing are cited herein, the 
disclosures of which are incorporated by reference in their entireties. 
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We Claim: 

1 . A method of generating a genetic reference dataset for use in the 
determination of the predicted probability for an individual of having a susceptibility 
for a developmental disorder due to genetic factors or for developing a developmental 
5 disorder due to genetic factors or for having offspring that develop a developmental 
disorder due to genetic factors comprising: 

(a) collecting a biological sample from a human subject; wherein the 
human subject is selected from the group consisting of a diagnostic proband, a blood 
relative of the diagnostic proband, an affected proband, a blood relative of the 

1 0 affected proband, a control proband, and a blood relative of the control proband; 

wherein the biological sample contains nucleic acids and/or proteins from the human 
subject; 

(b) analyzing the nucleic acids and/or proteins from the biological sample; 
wherein said analyzing results in a partial or full genotype for the alleles of the genes 

15 involved in folate, pyridoxine, and/or cobalamin metabolism; wherein said partial or 
full genotype forms a dataset of genetic explanatory variables for the human subject; 
and 

(c) compiling the dataset of genetic explanatory variables from multiple 
human subjects into a genetic reference dataset. 

20 2. A method of generating a genetic and environmental reference dataset for use 
in the determination of the predicted probability for an individual of having a 
susceptibility for a developmental disorder due to genetic factors and environmental 
factors or for developing a developmental disorder due to genetic factors and 
environmental factors or for having offspring that develop a developmental disorder 

25 due to genetic factors and environmental factors comprising: 

(a) obtaining dietary and epidemiological information for environmental 
explanatory variables for the human subjects of Claim 1 ; and 
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(b) combining said environmental explanatory variables with a genetic 
reference dataset for the human subjects. 



3. The method of Claim 2 wherein the developmental disorder is selected from 
the group consisting of schizophrenia, spina bifida cystica, Tourette's syndrome, 

5 dyslexia, conduct disorder, attention-deficit hyperactivity disorder, bipolar illness, 
autism, chronic multiple tic syndrome and obsessive-compulsive disorder. 

4. A method of generating an environmental reference dataset for use in the 
determination of the predicted probability for an individual of having a susceptibility 
for a developmental disorder due to environmental factors or for developing a 

10 developmental disorder due to environmental factors or for having offspring that 
develop a developmental disorder due to environmental factors comprising: 

(a) obtaining dietary and epidemiological information for environmental 
explanatory variables for a human subject; wherein the human subject is selected 
from the group consisting of a diagnostic proband, a blood relative of the diagnostic 

1 5 proband, an affected proband, a blood relative of the affected proband, a control 
proband, and a blood relative of the control proband; and 

(b) compiling a dataset of environmental explanatory variables from 
multiple human subjects into an environmental reference dataset for the human 
subjects. 

20 5 . A method of estimating the genetic susceptibility of an individual to have or 

to develop a developmental disorder comprising: 

(a) collecting a biological sample from one or more participants; wherein 

a participant is either the individual or a blood relative of the individual; and wherein 

the biological sample contains nucleic acids and/or proteins of the participant; 
25 (b) analyzing the nucleic acids and/or proteins from the biological sample; 

wherein said analyzing results in a partial or full genotype for the alleles of the genes 
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(b) combining said environmental explanatory variables with a genetic 
reference dataset for the human subjects. 



3. The method of Claim 2 wherein the developmental disorder is selected from 
the group consisting of schizophrenia, spina bifida cystica, Tourette's syndrome, 

5 dyslexia, conduct disorder, attention-deficit hyperactivity disorder, bipolar illness, 
autism, chronic multiple tic syndrome and obsessive-compulsive disorder. 

4. A method of generating an environmental reference dataset for use in the 
determination of the predicted probability for an individual of having a susceptibility 
for a developmental disorder due to environmental factors or for developing a 

1 0 developmental disorder due to environmental factors or for having offspring that 
develop a developmental disorder due to environmental factors comprising: 

(a) obtaining dietary and epidemiological information for environmental 
explanatory variables for a human subject; wherein the human subject is selected 
from the group consisting of a diagnostic proband, a blood relative of the diagnostic 

1 5 proband, an affected proband, a blood relative of the affected proband, a control 
proband, and a blood relative of the control proband; and 

(b) compiling a dataset of environmental explanatory variables from 
multiple human subjects into an environmental reference dataset for the human 
subjects. 
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to develop a developmental disorder comprising: 

(a) collecting a biological sample from one or more participants; wherein 

a participant is either the individual or a blood relative of the individual; and wherein 

the biological sample contains nucleic acids and/or proteins of the participant; 
25 (b) analyzing the nucleic acids and/or proteins from the biological sample; 

wherein said analyzing results in a partial or full genotype for the alleles of the genes 
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involved in folate, pyridoxine, and/or cobalamin metabolism; and wherein said partial 
or full genotype forms a dataset of genetic explanatory variables for the participants; 

(c) adding the datasets of genetic explanatory variables obtained from 
steps (a) and (b) to a genetic reference dataset therein forming a combined genetic 

5 dataset; 

(d) formulating a model comprising the genetic explanatory variables 
obtained from the participants; and 

(e) analyzing the combined genetic dataset; wherein a predicted 
probability for the individual of having or developing a developmental disorder is 

10 determined; and wherein the genetic susceptibility of an individual to have or to 
develop a developmental disorder is estimated. 

6. The method of Claim 5 wherein said analyzing the combined genetic dataset 
is performed by binary linear regression. 

7. . The method of Claim 6 further comprising the step of : 

1 5 (f) modifying the model by adding or subtracting a genetic explanatory 

variable; and re-analyzing the combined genetic dataset by binary logistic regression; 
wherein a model is chosen that best fits the data. 

8. The method of Claim 7 further comprising the step of : 
(g) testing the model for goodness of fit. 

20 9. The method of Claim 8 wherein the binary linear regression is performed with 
the SAS system. 

10. The method of Claim 5 wherein the developmental disorder is selected from 
the group consisting of schizophrenia, spina bifida cystica, Tourette's syndrome, 
dyslexia, conduct disorder, attention-deficit hyperactivity disorder, bipolar illness, 
25 autism, chronic multiple tic syndrome and obsessive-compulsive disorder. 
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1 1 . The method of Claim 10 wherein the developmental disorder is schizophrenia 
and the individual is suspected of being genetically susceptible of having or for 
developing schizophrenia. 

12. The method of Claim 1 1 wherein the individual is suspected of being 

5 genetically susceptible for having or for developing schizophrenia because a blood 
relative has schizophrenia. 

13. The method of Claim 12 wherein the blood relative is a parent, a sibling, or a 
grandparent. 

14. The method of Claim 1 3 wherein the blood relative is a parent and wherein the 
1 0 parent is the mother of the individual. 

15. A method of estimating the genetic and environmental susceptibility of an 
individual to have or to develop a developmental disorder comprising: 

(a) collecting a biological sample from one or more participants; wherein 
a participant is either the individual or a blood relative of the individual; and wherein 

1 5 the biological sample contains nucleic acids and/or proteins of the participant; 

(b) analyzing the nucleic acids and/or proteins from the biological sample; 
wherein said analyzing results in a partial or full genotype for the alleles of the genes 
involved in folate, pyridoxine, and/or cobalamin metabolism; and wherein said partial 
or full genotype forms a dataset of genetic explanatory variables for the participants; 

20 (c) obtaining dietary and epidemiological information for environmental 

explanatory variables for the participants; wherein said information forms a dataset of 
environmental explanatory variables for the participants; 

(d) adding the datasets of genetic explanatory variables obtained from 
steps (a) and (b) and the dataset of environmental explanatory variables of step (c) to 

25 a genetic and environmental reference dataset therein forming a combined genetic and 
environmental dataset; 

(e) formulating a model comprising the genetic and environmental 
explanatory variables obtained from the participants; and 
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(f) analyzing the combined genetic and environmental dataset by binary 
logistic regression; 

wherein a predicted probability for the individual of having or developing a 
developmental disorder is determined; and wherein the genetic and environmental 
5 susceptibility of an individual to have or to develop a developmental disorder is 
estimated. 

16. The method of Claim 15 further comprising the step of : 

(g) modifying the model by adding or subtracting a genetic or 
environmental explanatory variable; and re-analyzing the combined genetic and 

1 0 environmental dataset by binary logistic regression; wherein a model is chosen that 
best fits the data. 

1 7. The method of Claim 1 6 further comprising the step of : 

(h) testing the model for goodness of fit. 

18. The method of Claim 1 7 wherein the binary linear regression is performed 
15 with the SAS system. 

19. A method of estimating the susceptibility of an individual to have offspring 
that develop a developmental disorder comprising: 

(a) collecting a biological sample from one or more participants; wherein 
a participant is either the individual or a blood relative of the individual; and wherein 

20 the biological sample contains nucleic acids and/or proteins of the participant; 

(b) analyzing the nucleic acids and/or proteins from the biological sample; 
wherein said analyzing results in a partial or full genotype for the alleles of the genes 
involved in folate, pyridoxine, and/or cobalamin metabolism; and wherein said partial 
or full genotype forms a dataset of genetic explanatory variables for the participants; 

25 (c) adding the datasets of genetic explanatory variables obtained from 

steps (a) and (b) to a genetic reference dataset therein forming a combined genetic 
dataset; 
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(f) analyzing the combined genetic and environmental dataset by binary 
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steps (a) and (b) to a genetic reference dataset therein forming a combined genetic 
dataset; 



WO 00/71754 



PCT/US00/14354 



103 

(d) formulating a model comprising the genetic explanatory variables 
obtained from the participants; and 

(e) analyzing the combined genetic dataset by binary logistic regression; 
wherein a predicted probability for the individual to have offspring that 

develop a developmental disorder is determined; and wherein the genetic and 
environmental susceptibility of an individual to have offspring that develop a 
developmental disorder is estimated. 

20. The method of Claim 1 9 further comprising the step of : 

(f) modifying the model by adding or subtracting a genetic explanatory 
variable; and re-analyzing the combined genetic dataset by binary logistic regression; 
wherein a model is chosen that best fits the data. 

2 1 . The method of Claim 20 further comprising the step of : 

(g) testing the model for goodness of fit. 

22. The method of Claim 2 1 wherein the binary linear regression is performed 
with the SAS system. 

23. The method of Claim 22 wherein the individual is a pregnant woman. 



24. A method of lowering the risk of a pregnant woman who has been determined 
by the method of Claim 23 to be susceptible to have offspring that develop a 
developmental disorder comprising administering methylfolate, cobalamin or 
pyridoxine to the pregnant woman, wherein said administering lowers the risk of the 
pregnant woman of giving birth to offspring with a developmental disorder. 

25. A method of determining if any treatment is advisable for a pregnant woman 
who has been determined by the method of Claim 23 to be susceptible to having 
offspring that develop a developmental disorder comprising determining the 
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concentration of a risk factor from a tissue sample or body fluid from the pregnant 
woman; wherein when the concentration of the risk factor is statistically above or 
below an accepted normal range, treatment is advisable. 

26. A method of monitoring the effect of the administration of methylfolate, 

5 cobalamin or pyridoxine to the pregnant woman of Claim 25, comprising determining 
* the concentration of a risk factor from a tissue sample or body fluid from the pregnant 
woman; and wherein when the concentration of the risk factor is statistically within 
an accepted normal range, the treatment is effective. 

27. The method of Claim 26 wherein the risk factor is selected from the group 
1 0 consisting of homocysteine, folate, and cobalamin. 

28. The method of Claim 22 wherein the individual is the mate of a pregnant 
woman. 

29. A method of treating an asymptomatic individual determined by the method of 
Claim 23 to be susceptible for developing a developmental disorder comprising 

1 5 administering methylfolate, cobalamin or pyridoxine. 

30. An isolated nucleic acid encoding a genetic variant of human dihydrofolate 
reductase comprising a nucleotide sequence having a 19 base-pair deletion spanning 
nucleotides 540 to 558 of the nucleotide sequence of SEQ ID NO:41. 

3 1 . The isolated nucleic acid of Claim 30 that has the nucleotide sequence of SEQ 
20 IDNO:42. 

32. An expression vector comprising the nucleic acid of Claim 30 operably 
associated with an expression control sequence, wherein the nucleic acid is selected 
from the group consisting of cDNA or genomic DNA. 



WO 00/71754 



PCT/USOO/14354 



104 

concentration of a risk factor from a tissue sample or body fluid from the pregnant 
woman; wherein when the concentration of the risk factor is statistically above or 
below an accepted normal range, treatment is advisable. 

26. A method of monitoring the effect of the administration of methylfolate, 

5 cobalamin or pyridoxine to the pregnant woman of Claim 25, comprising determining 
the concentration of a risk factor from a tissue sample or body fluid from the pregnant 
woman; and wherein when the concentration of the risk factor is statistically within 
an accepted normal range, the treatment is effective. 

27. The method of Claim 26 wherein the risk factor is selected from the group 
10 consisting of homocysteine, folate, and cobalamin. 

28. The method of Claim 22 wherein the individual is the mate of a pregnant 
woman. 

29. A method of treating an asymptomatic individual determined by the method of 
Claim 23 to be susceptible for developing a developmental disorder comprising 

1 5 administering methylfolate, cobalamin or pyridoxine. 

30. An isolated nucleic acid encoding a genetic variant of human dihydrofolate 
reductase comprising a nucleotide sequence having a 19 base-pair deletion spanning 
nucleotides 540 to 558 of the nucleotide sequence of SEQ ID NO:41. 

3 1 . The isolated nucleic acid of Claim 30 that has the nucleotide sequence of SEQ 
20 IDNO:42. 

32. An expression vector comprising the nucleic acid of Claim 30 operably 
associated with an expression control sequence, wherein the nucleic acid is selected 
from the group consisting.of cDNA or genomic DNA. 



WO 00/71754 



PCT/USOO/14354 



105 

33. A PCR primer that can be used to distinguish SEQ ID NO:42 from the 
nucleotide sequence selected from the group consisting of SEQ ID NO:41 and SEQ 
IDNO:45. 

34. The PCR primer of Claim 33 that comprises 1 0 to 50 consecutive nucleotides 
5 from the nucleotide sequence selected from the group of SEQ ID NO: 41 , the 

complementary strand of SEQ ID NO: 41 , SEQ ID NO:42 v the complementary strand 
of SEQ ID NO: 42, SEQ ID NO.45, and the complementary strand of SEQ ID NO: 
45. 

35. The PCR primer of Claim 34 wherein the 1 0 to 50 consecutive nucleotides are 
10 from nucleotides 350 to 530 of SEQ ID NO:41. 

36. The PCR primer of Claim 35 having the nucleotide sequence of 5'-CTA AAC 
TGC ATC GTC GCT GTG-3' (SEQ ID NO:38). 

37. The PCR primer of Claim 36 wherein the 1 0 to 50 consecutive nucleotides are 
from the complementary strand of nucleotides 550 to 850 of SEQ ID NO:41 . 

15 38. The PCR primer of Claim 37 having the nucleotide sequence of 5'- AAA AGG 
GGA ATC CAG TCG G-3* (SEQ ID NO:39). 

39. An isolated nucleic acid that hybridizes under standard hybridization 
conditions to a nucleic acid having the nucleotide sequence 

ACCTGGGCGGGACGCGCCA (SEQ ID NO:40) or a sequence complementary to 
20 SEQ ID NO:40; wherein said isolated nucleic acid consists of 12 to 48 nucleotides. 

40. An isolated nucleic acid that hybridizes to the nucleotide sequence of SEQ ID 
NO:42, but not to the nucleotide sequence of SEQ ID NO:41; when said hybridizing 
is performed under identical conditions. 
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33. A PCR primer that can be used to distinguish SEQ ID NO:42 from the 
nucleotide sequence selected from the group consisting of SEQ ID NO:41 and SEQ 
IDNO:45. 

34. The PCR primer of Claim 33 that comprises 10 to 50 consecutive nucleotides 
5 from the nucleotide sequence selected from the group of SEQ ID NO: 41 , the 

complementary strand of SEQ ID NO: 41 , SEQ ID N0.42, the complementary strand 
of SEQ ID NO: 42, SEQ ID N0.45, and the complementary strand of SEQ ID NO: 
45. 

35. The PCR primer of Claim 34 wherein the 10 to 50 consecutive nucleotides are 
10 from nucleotides 350 to 530 of SEQ ID NO:41. 

36. The PCR primer of Claim 35 having the nucleotide sequence of 5'-CTA AAC 
TGC ATC GTC GCT GTG-3' (SEQ ID NO:38). 

37. The PCR primer of Claim 36 wherein the 10 to 50 consecutive nucleotides are 
from the complementary strand of nucleotides 550 to 850 of SEQ ID NO:41. 

15 38. The PCR primer of Claim 37 having the nucleotide sequence of 5-AAA AGG 
GGA ATC CAG TCG G-3 1 (SEQ ID NO:39). 

39. An isolated nucleic acid that hybridizes under standard hybridization 
conditions to a nucleic acid having the nucleotide sequence 

ACCTGGGCGGGACGCGCCA (SEQ ID NO:40) or a sequence complementary to 
20 SEQ ID NO:40; wherein said isolated nucleic acid consists of 12 to 48 nucleotides. 

40. An isolated nucleic acid that hybridizes to the nucleotide sequence of SEQ ID 
NO:42, but not to the nucleotide sequence of SEQ ID NO:41 ; when said hybridizing 
is performed under identical conditions. 
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41 . An isolated nucleic acid that hybridizes to the complementary strand of the 
nucleotide sequence of SEQ ID NO:42, but not to the complementary strand of the 
nucleotide sequence of SEQ ID NO:41; when said hybridizing is performed under 
identical conditions. 

5 42. An isolated nucleic acid that hybridizes to the nucleotide sequence of SEQ ID 
NO:41, but not to the nucleotide sequence of SEQ ID NO:42; when said hybridizing 
is performed under identical conditions. 

43. An isolated nucleic acid that hybridizes to the complementary strand of the 
nucleotide sequence of SEQ ID NO:41, but not to the complementary strand of the 

1 0 nucleotide sequence of SEQ ID NO:42; when said hybridizing is performed under 
identical conditions. 

44. The method of Claim 5 wherein said analyzing the nucleic acids and/or 
proteins from the biological sample comprises determining if the biological sample 
contains a genetic variant of human dihydrofolate reductase having a nucleotide 

1 5 sequence with a 19 base-pair deletion spanning nucleotides 540 to 558 of the 

nucleotide sequence of SEQ ID NO:41; and wherein the genetic variant of human 
dihydrofolate reductase is an explanatory variable. 

45. The method of Claim 44 wherein said determining is performed by a method 
selected from the group consisting of PCR, special PCR, RT PCR, RFLP analysis, 

20 SSCP, and FISH. 

46. The method of Claim 1 wherein said analyzing the nucleic acids and/or 
proteins from the biological sample comprises determining if the biological sample 
contains the genetic variant of human dihydrofolate reductase having a nucleotide 
sequence with a 19 base-pair deletion spanning nucleotides 540 to 558 of the 

25 nucleotide sequence of SEQ ID NO:41 ; and wherein the genetic variant of human 
dihydrofolate reductase is an explanatory variable. 
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41 . An isolated nucleic acid that hybridizes to the complementary strand of the 
nucleotide sequence of SEQ ID NO:42, but not to the complementary strand of the 
nucleotide sequence of SEQ ID NO:41; when said hybridizing is performed under 
identical conditions. 

5 42. An isolated nucleic acid that hybridizes to the nucleotide sequence of SEQ ID 
NO:41, but not to the nucleotide sequence of SEQ ID NO:42; when said hybridizing 
is performed under identical conditions. 

43. An isolated nucleic acid that hybridizes to the complementary strand of the 
nucleotide sequence of SEQ ID NO:41, but not to the complementary strand of the 

10 nucleotide sequence of SEQ ID NO:42; when said hybridizing is performed under 
identical conditions. 

44. The method of Claim 5 wherein said analyzing the nucleic acids and/or 
proteins from the biological sample comprises determining if the biological sample 
contains a genetic variant of human dihydrofolate reductase having a nucleotide 

1 5 sequence with a 19 base-pair deletion spanning nucleotides 540 to 558 of the 

nucleotide sequence of SEQ ID NO:41 ; and wherein the genetic variant of human 
dihydrofolate reductase is an explanatory variable. 

45. The method of Claim 44 wherein said determining is performed by a method 
selected from the group consisting of PCR, special PCR, RT PCR, RFLP analysis, 

20 SSCP, and FISH. 

46. The method of Claim 1 wherein said analyzing the nucleic acids and/or 
proteins from the biological sample comprises determining if the biological sample 
contains the genetic variant of human dihydrofolate reductase having a nucleotide 
sequence with a 19 base-pair deletion spanning nucleotides 540 to 558 of the 

25 nucleotide sequence of SEQ ID NO:41 ; and wherein the genetic variant of human 
dihydrofolate reductase is an explanatory variable. 
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47. The method of Claim 46 wherein said determining is performed by a method 
selected from the group consisting of PCR, special PCR, RT PCR, RFLP analysis, 
SSCP, and FISH. 
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47. 



The method of Claim 46 wherein said determining is performed by a method 
selected from the group consisting ofPCR, special PGR, RT PGR, RFLP analysis, 



SSCP, and FISH. 
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Primers for PCR Anplif ication the 
Forward primer: 5'-CEA 
Reverse primer: 5'-AAA 



DHFR Deletion Polymorphism Region 
AAC TGC ATC GTC GGT CTG-3' 
AGG GGA ATC CAG TOG G-3' 



Figure 1 



WO 00/71754 PCT/USOO/14354 

1/5 



Primers for PCR Amplif ication the 
Forward primer: 5 '-CIA 

Reverse primer: 5 '-AAA 



DHFR Deletion Polyinarphism Region 
AAC 1GC ATC CTC GCT CTG-3' 
AGG GGA ATC CAG TOG G-3' 



Figure 1 
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Genotypes of the DHFR 19 bp Deletion 
by Ncn-denaturing Polyacrylamide Gel Electrophoresis 



CO 




Figure 2 
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Genotypes of the DHFR 19 bp Deletion 
by Non-denaturing Polyacrylamide Gel Electrophoresis 



CL 

CO 
CM 



C7> 




Figure 2 
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Sequences of PCR Amplification Products 
in the Region of the DHFR Deletion Polymorphism Region 

* 

Allele 1 GCTGCCCACGCT^^ 

llllllllllllllllll lliillililllllllil 
Allele 2 GCIX30CCAaX3TOQQQCT. GQCOGACI«XX3GOGAGA 



Figure 3 
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Sequences of PCR Aitplif icat ion Products 
in the Region of the DHFR Deletion Polymorphism Region 

* 

Allele 1 GCIGCCCACXXnra^ 

111111111111111111 IMIIIIIIMIMIIII 
Allele 2 GCIGCCCAOGGTOGGGCT GGCOGACTOOOGGOGAGA 



Figure 3 
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1 CTGCAGCGCC AGGGTCCACC TGGTCGGCTG CACCTGTGGA GGAGGAGGTG, 
51 GATTTCAGGC TTCCCGTAGA CTGGAAGAAT CGGCTCAAAA CCGCTTGCCT 
101 CGCAGGGGCT GAGCTGGAGG CAGCGAGGCC GCCCGACGCA GGCTTCCGGC 
151 GAGACATGGC AGGGCAAGGA TGGCAGCCCG GCGGCAGGGC CCGGCGAGGA 
201 GCGCGAACCC GCGGCCGCAG TTCCCAGGCG TCTGCGGGCG CGAGCACGCC 
251 GCGACCCTGC GTGCGCCGGG GCGGGGGGGC GGGGCCTCGC CTGCACAAAT 
301 AGGGACGAGG GGGCGGGGCG GCCACAATTT CGCGCCAAAC TTGACCGCGC 
351 GTTCTGCTGT AACGAGCGGG CTCGGAGGTC CTCCCGCTGC TGTCATGGTT 
401 GGTTCGCTAA ACTGCATCGT CGCTGTGTCC CAGAACATGG GCATCGGCAA* 
451 GAACGGGGAC CTGCCCTGGC CACCGCTCAG GTATCTGCCG GGCCGGGGCG 
501 ATGGGACCCA AACGGGCGCA GGCTGCCCAC GGTCGGGGTA CCT-GGGCGGG 
551 ACGCGCCAGG CCGACTCCCG GCGAGAGGAT GGGGCCAGAC TTGCGGTCTG 
601 CGCTGGCAQG AAGGGTGGGC CCGACTGGAT TCCCCTTTTC TGCTGCGCGG 
651 GAGGCCCAGT TGCTGATTTC TGCCCGGATT CTGCTGCCCG GTGAGGTCTT 
701 TGCCCTGCGG CGCCCTCGCC CAGGGCAAAG TCCCAGCCCT GGAGAAAACA 
751 CCTCACCCCT ACCCACAGCG CTCCGTTTGT CAGGTGCCTT AGAGCTCGAG 
801 CCCAAGGGAT AATGTTTCGA GTAAGGCTGT TTCTCTAACT TGTAGGAATG 
851 AATTCAGATA TTTCCAGAGA ATGACCACAA CCTCTTCAGT AGAAGGTAAT 
901 GTGGGATTAA GTAGGGTCTT GCTTGATGAA GTTTACCAGT GCAAATGTTA 
951 GTTAAATGGA AAGTTTTCCG TGTTAATCTG GGACCTTTTC TCTTATTATG 
1001 GATCTGTATG ATCTGTATGC AGTTCCCAAG GTTCATTTAC CAT TAT T AAA 
1051 AAATTTTTGT CTTAGAAATT TTATGTATGT CAACGCACGA GCAAATTATC 
1101 AGGCATGGGG CAGAATTGGC AACTGGGTGG AGGCTTCGGT GGAGGTTAGC 
1151 ACTCGGAAAG GAAAACAGAG TAGGCCTTTG GAACAGCTGC TGGAAGAGAT 
1201 AAGGCCTGAA CAAGGGCAGT GGAGAAGAGA GGGTAAAAAT TTTTTAAGGT 
1251 TACATGACCC TGGATTTTGG AGATC 



Figure 4A 
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1 CTGCAGCGCC AGGGTCCACC TGGTCGGCTG CACCTGTGGA GGAGGAGGTG 
51 GATTTCAGGC TTCCCGTAGA CTGGAAGAAT. CGGCTCAAAA CCGCTTGCCT 
101 CGCAGGGGCT GAGCTGGAGG CAGCGAGGCC GCCCGACGCA GGCT.TCCGGC 
151 GAGACATGGC AGGGCAAGGA TGGCAGCCCG GCGGCAGGGC CGGGCGAGGA 
201 GCGCGAACCC GCGGCCGCAG. TTCCCAGGCG TCTGCGGGCG CGAGCACGCC 
251 GCGACCCTGC GTGCGCCGGG GCGGGGGGGG GGGGCCTCGC CTGCACAAAT 
301 AGGGACGAGG GGGCGGGGCG GCCACAATTT CGCGCCAAAC TTGACCGCGC 
351 GTTCTGCTGT AACGAGCGGG CTCGGAGGTG CTCCCGCTGC TGTCATGGTT 
4 01 GGTTCGCTAA ACTGCATCGT CGCTGTGTCC CAGAACATGG GCATCGGCAA 
451 GAACGGGGAC CTGCCCTGGC CACCGCTCAG GTATCTGCCG GGCCGGGGCG 
501 ATGGGACCCA AACGGGCGCA GGCTGCCCAC GGTCGGGGTA CCTGGGCGGG 
551 ACGCGCCAGG CCGACTCCCG GCGAGAGGAT GGGGCCAGAC TTGCGGTCTG 
601 CGCTGGCAGG AAGGGTGGGC CCGACTGGAT TCCCCTTTTC TGCTGCGCGG 
651 GAGGCCCAGT TGCTGATTTC TGCCCGGATT CTGCTGCCCG GTGAGGTCTT 
701 TGCCCTGCGG CGCCCTCGCC CAGGGCAAAG TCCCAGCCCT GGAGAAAACA 
751 CCTCACCCCT ACCCACAGCG CTCCGTTTGT CAGGTGCGTT AGAGCTCGAG 
801 CCCAAGGGAT AATGTTTCGA GTAACGCTGT TTCTCTAACT TGTAGGAATG 
851 AATTCAGATA TTTCCAGAGA ATGACCACAA CCTCTTCAGT AGAAGGTAAT 
901 GTGGGATTAA GTAGGGTCTT GCTTGATGAA GTTTACCAGT GCAAATGTTA 
951 GTTAAATGGA AAGTTTTCCG TGTTAATCTG GGACCTTTTC TCTTATTATG 
1001 GATCTGTATG ATCTGTATGC AGTTCCCAAG GTTCATTTAC CAT TAT T AAA 
1051 AAATTTTTGT CTTAGAAATT TTATGTATGT CAACGCACGA GCAAATTATC 
1101 AGGCATGGGG CAGAATTGGC AACTGGGTGG AGGCTTCGGT GGAGGTTAGC 
1151 ACTCCGAAAG GAAAACAGAG ■ TAGGCCTTTG GAACAGCTGC TGGAAGAGAT 
1201 AAGGCCTGAA CAAGGGCAGT GGAGAAGAGA GGGTAAAAAT TTTTTAAGGT 
1251 TACATGACCC TGGATTTTGG AGATC 



Figure 4A 
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1 CTGCAGCGCC AGGGTCCACC TGGTCGGCTG CACCTGTGGA GGAGGAGGTG 
51 GATTTCAGGC TTCCCGTAGA CTGGAAGAAT CGGCTCAAAA CCGCTTGCCT 
101 CGCAGGGGCT GAGCTGGAGG CAGCGAGGCC GCCCGACGCA GGCTTCCGGC 
151 GAGACATGGC AGGGCAAGGA TGGCAGCCCG GCGGCAGGGC CCGGCGAGGA 
201 GCGCGAACCC GCGGCCGCAG TTCCCAGGCG TCTGCGGGCG CGAGCACGCC 
251 GCGACCCTGC GTGCGCCGGG GCGGGGGGGC GGGGCCTCGC CTGCACAAAT 
301 AGGGACGAGG GGGCGGGGCG GCCACAATTT CGCGCCAAAC TTGACCGCGC 
3 51 GTTCTGCTGT AACGAGCGGG CTCGGAGGTC CTCCCGCTGC TGTCATGGTT 
401 GGTTCGCTAA ACTGCATCGT CGCTGTGTCC CAGAACATGG GCATCGGCAA 
451 GAACGGGGAC CTGCCCTGGC CACCGCTCAG GTATCTGCGG GGCCGGGGCG 
501 ATGGGACCCA AACGGGCGCA GGCTGCCCAC GGTCGGGGT 
551 GG CCGACTCCCG GCGAGAGGAT GGGGCCAGAC TTGCGGTCTG 

601 CGCTGGCAGG AAGGGTGGGC CCGACTGGAT TCCCCTTTTC TGCTGCGCGG 
651 GAGGCCCAGT TGCTGATTTC TGCCCGGATT CTGCTGCCCG GTGAGGTCTT 
701 TGCCCTGCGG CGCCCTCGCC CAGGGCAAAG TCCCAGCCCT GGAGAAAACA 
751 CCTCACCCCT ACCCACAGCG CTCCGTTTGT- CAGGTGCCTT AGAGCTCGAG 
801 CCCAAGGGAT AATGTTTCGA -GTAACGCTGT TTCTCTAACT TGTAGGAATG 
851 AATTCAGATA TTTCCAGAGA ATGACCACAA CCTCTTCAGT AGAAGGTAAT 
901 GTGGGATTAA GTAGGGTCTT GCTTGATGAA GTTTACCAGT GCAAATGTTA 
951 GTTAAATGGA AAGTTTTCCG TGTTAATCTG GGACCTTTTC TCTTATTATG 
1001 GATCTGTATG ATCTGTATGC AGTTCCCAAG GTTCATTTAC CAT TAT T AAA 
1051 AAATTTTTGT CTTAGAAATT TTATGTATGT ■ CAACGCACGA GCAAATTATC 
11 Oi AGGCATGGGG CAGAATTGGC AACTGGGTGG AGGCTtCGGT ' GGAGGTTAGC 
11 51 ACTCCGAAAG GAAAACAGAG TAGGCCTTTG GAACAGCTGC TGGAAGAGAT 
1201 AAGGCCTGAA CAAGGGCAGT GGAGAAGAGA GGGTAAAAAT TTTTTAAGGT 
1251 TACATGACCC TGGATTTTGG AGATC 



Figure 4B 
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1 CTGCAGCGCC AGGGTCCACC 
51 GATTTCAGGC TTCCCGTAGA 
101 CGCAGGGGCT GA3CTGGAGG 
151 GAGACATGGC AGGGCAAGGA 
201 GCGCGAACGC GCGGCCGCAG 
251 GCGACCCTGC GTGCGCCGGG 
301 AGGGACGAGG GGGCGGGGCG 
351 GTTCTGCTGT AACGAGCGGG 
401 GGTTCGCTAA ACTGCATCGT 
451 GAACGGGGAC CTGCCCTGGC 
501 ATGGGACCCA AACGGGCGCA 
551 GG CCGACTCCCG 

601 CGCTGGCAGG AAGGGTGGGC 
651 GAGGCCCAGT TGCTGATTTC 
701 TGCCCTGCGG CGCCCTCGCC 
751 CCTCACCCCT ACCCACAGCG 
8 01 CCCAAGGGAT AATGTTTCGA 
851 AATTCAGATA TTTCCAGAGA 
901 GTGGGATTAA GTAGGGTCTT 
951 GTTAAATGGA AAGTTTTCCG 
1001 GATCTGTATG ATCTGTATGC 
1051 AAATTTTTGT CTTAGAAATT 
1101 AGGCATGGGG CAGAATTGGC 
1151 ACTCCGAAAG GAAAACAGAG 
1201 AAGGCCTGAA CAAGGGCAGT 
1251 TACATGACCC TGGATTTTGG 



TGGTCGGCTG CACCTGTGGA GGAGGAGGTG 
CTGGAAGAAT CGGCTCAAAA CCGCTTGCCT 
CAGCGAGGCC- GCCCGACGCA GGCTTCCGGC* 
TGGCAGCCCG GCGGCAGGGC CCGGCGAGGA." 
TTCCCAGGCG TCTGCGGGCG CGAGCACGCC 
GCGGGGGGGC GGGGCCTCGC CTGCACAAAT 
GCCACAATTT CGCGCCAAAC TTGACCGCGC . 
CTCGGAGGTC CTCCCGCTGC TGTCATGGTT ; 
CGCTGTGTCC CAGAACATGG GCATCGGCAA 
CACCGCTCAG GTATCTGCCG GGCCGGGGCG 
GGCTGCCCAC GGTCGGGGT 
GCGAGAGGAT GGGGCCAGAC TTGCGGTCTG 
CCGACTGGAT TCCCCTTTTC TGCTGCGCGG 
TGCCCGGATT CTGCTGCCCG GTGAGGTCTT 
CAGGGCAAAG TCCCAGCCCT GGAGAAAACA 
CTCCGTTTGT- CAGGTGCCTT AGAGCTCGAG 
GTAACGCTGT TTCTCTAACT TGTAGGAATG 
ATGACCACAA CCTCTTCAGT AGAAGGTAAT 
GCTTGATGAA GTTTACCAGT GCAAATGTTA 
TGTTAATCTG GGACCTTTTC TCTTATTATG 
AGTTCCCAAG GTTCATTTAC CAT TAT T AAA 
TTATGTATGT CAACGCACGA GCAAATTATC 
AACTGGGTGG AGGCTTCGGT GGAGGTTAGC 
TAGGCCTTTG GAACAGCTGC TGGAAGAGAT. 
GGAGAAGAGA GGGTAAAAAT TTTTTAAGGT 
AGATC 



Figure 4B 



WO 00/71754 



PCT/USOO/14354 



SEQUENCE LISTING 

<110> Johnson, William G. 
Stenroos, Edward S. 

<120> METHODS FOR DIAGNOSING, PREVENTING , AND TREATING 
DEVELOPMENTAL DISORDERS 

<130> 601-1-057PCT 

<140> UNAS SIGNED 
<141> 2000-05-24 

<150> UNAS SIGNED 
<151> 2000-05-23 

<150> 60/136,198 
<151> 1999-05-25 

<160> 46 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2187 
<212> DNA 

<213> Homo sapiens 
<400> 1 

gccatggtga acgaagccag aggaaacagc agcctcaacc cctgcttgga gggcagtgcc 60 
agcagtggca gtgagagctc caaagatagt tcgagatgtt ccaccccggg cctggaccct 120 
gagcggcatg agagactccg ggagaagatg aggcggcgat tggaatctgg tgacaagtgg 180 
ttctccctgg aattcttccc tcctcgaact gctgagggag ctgtcaatct catctcaagg 240 
tttgaccgga tggcagcagg tggccccctc tacatagacg cgacctggca cccagcaggt 300 
gaccctggct cagacaagga gacctcctcc atgatgatcg ccagcaccgc cgtgaactac 3 60 
tgtggcctgg agaccatcct gcacatgacc tgctgccgtc agcgcctgga ggagatcacg 420 
ggccatctgc acaaagctaa gcagctgggc ctgaagaaca tcatggcgct gcggggagac 480 
ccaataggtg accagtggga agaggaggag ggaggcttca actacgcagt ggacctggtg 540 
aagcacatcc gaagtgagtt tggtgactac tttgacatct gtgtggcagg ttaccccaaa 600 
ggccaccccg aagcagggag ctttgaggct gacctgaagc acttgaagga gaaggtgtct 660 
gcgggagccg atttcatcat cacgcagctt ttctttgagg ctgacacatt cttccgcttt 720 
gtgaaggcat gcaccgacat gggcatcact tgccccatcg tccccgggat ctttcccatc 780 
cagggctacc actcccttcg gcagcttgtg aagctgtcca agctggaggt gccacaggag 840 
atcaaggacg tgattgagcc aatcaaagac aacgatgctg ccatccgcaa ctatggcatc 900 
gagctggccg tgagcctgtg ccaggagctt ctggccagtg gcttggtgcc aggcctccac 960 
ttctacaccc tcaaccgcga gatggctacc acagaggtgc tgaagcgcct ggggatgtgg 1020 
actgaggacc ccaggcgtcc cctaccctgg gctctcagtg cccaccccaa gcgccgagag 1080 
gaagatgtac gtcccatctt ctgggcctcc agaccaaaga gttacatcta ccgtacccag 1140 
gagtgggacg agttccctaa cggccgctgg ggcaattcct cttcccctgc ctttggggag 1200 
ctgaaggact actacctctt ctacctgaag agcaagtccc ccaa'ggagga gctgctgaag 12 60 
atgtgggggg aggagctgac cagtgaagca agtgtctttg aagtctttgt tctttacctc 1320 



I 



WO 00/71754 



PCT/USOO/14354 



SEQUENCE LISTING 

<110> Johnson, William G. 
Stenroos, Edward S. 

<120> METHODS FOR DIAGNOSING, PREVENTING , AND TREATING 
DEVELOPMENTAL DISORDERS 

<130> 601-1-057PCT 

<140> UNAS SIGNED 
<141> 2000-05-24 

<150> UNAS SIGNED 
<151> 2000-05-23 

<150> 60/136,198 
<151> 1999-05-25 

<160> 46 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2187 
<212> DNA 

<213> Homo sapiens 
<400> 1 

gccatggtga acgaagccag aggaaacagc agcctcaacc cctgcttgga gggcagtgcc 60 

agcagtggca gtgagagctc caaagatagt tcgagatgtt ccaccccggg cctggaccct 120 

gagcggcatg agagactccg ggagaagatg aggcggcgat tggaatctgg tgacaagtgg 180 

ttctccctgg aattcttccc tcctcgaact gctgagggag ctgtcaatct catctcaagg 240 

tttgaccgga tggcagcagg tggccccctc tacatagacg cgacctggca cccagcaggt 300 

gaccctggct cagacaagga gacctcctcc atgatgatcg ccagcaccgc cgtgaactac 360 

tgtggcctgg agaccatcct gcacatgacc tgctgccgtc agcgcctgga ggagatcacg 420 

ggccatctgc acaaagctaa gcagctgggc ctgaagaaca tcatggcgct gcggggagac 480 

ccaataggtg accagtggga agaggaggag ggaggcttca actacgcagt ggacctggtg 540 

aagcacatcc gaagtgagtt tggtgactac tttgacatct gtgtggcagg ttaccccaaa 600 

ggccaccccg aagcagggag ctttgaggct gacctgaagc acttgaagga gaaggtgtct 660 

gcgggagccg atttcatcat cacgcagctt ttctttgagg ctgacacatt cttccgcttt 720 

gtgaaggcat gcaccgacat gggcatcact tgccccatcg tccccgggat ctttcccatc 780 

cagggctacc actcccttcg gcagcttgtg aagctgtcca agctggaggt gccacaggag 840 

atcaaggacg tgattgagcc aatcaaagac aacgatgctg ccatccgcaa ctatggcatc 900 

gagctggccg tgagcctgtg ccaggagctt ctggccagtg gcttggtgcc aggcctccac 960 

ttctacaccc tcaaccgcga gatggctacc acagaggtgc tgaagcgcct ggggatgtgg 1020 

actgaggacc ccaggcgtcc cctaccctgg gctctcagtg cccaccccaa gcgccgagag 1080 

gaagatgtac gtcccatctt ctgggcctcc agaccaaaga gttacatcta ccgtacccag 1140 

gagtgggacg agttccctaa cggccgctgg ggcaattcct cttcccctgc ctttggggag 1200 

ctgaaggact actacctctt ctacctgaag agcaagtccc ccaaggagga gctgctgaag 1260 

atgtgggggg aggagctgac cagtgaagca agtgtctttg aagtctttgt tctttacctc 1320 
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tcgggagaac caaaccggaa tggtcacaaa 
ctggcggctg agaccagcct gctgaaggag 
ctcaccatca actcacagcc caacatcaac 
tggggcccca gcgggggcta tgtcttccag 
gagacagcgg aagcacttct gcaagtgctg 
cttgtcaatg tgaagggtga aaacatcacc 
acttggggca tcttccctgg gcgagagatc 
ttcatgttct ggaaggacga ggcctttgcc 
gaggaggagt ccccgtcccg caccatcatc 
aacctggtgg acaatgactt cccactggac 
ttggagcttc tcaacaggcc cacccagaat 
gtcctgacgc cctgcgttgg agccactcct 
ctcttgggaa ctccactctc cttcgtgtct 
acaatggcag ctagactgga gtgaggcttc 
catgggaacc tagtactctc tgctcta 



gtgacttgcc tgccctggaa cgatgagccc 1380 

gagctgctgc gggtgaaccg ccagggcatc 1440 

gggaagccgt cctccgaccc catcgtgggc 1500 

aaggcctact tagagttttt cacttcccgc 1560 

aagaagtacg agctccgggt taattaccac 1620 

aatgcccctg aactgcagcc gaatgctgtc 1680 

atccagccca ccgtagtgga tcccgtcagc 1740 

ctgtggattg agcggtgggg aaagctgtat 1800 

cagtacatcc acgacaacta cttcctggtc 1860 

aactgcctct ggcaggtggt ggaagacaca 1920 

gcgagagaaa cggaggctcc atgaccctgc 1980 

gtcccgcctt cctcctccac agtgctgctt 2040 

ctcccacccc ggcctccact cccccacctg 2100 

caggctcttc ctggacctga gtcggcccca. 2160 

.2187 



<210> 2 
<211> 7122 
<212> DNA 

<213> Homo sapiens 
<400> 2 

gcgcgtgtct ggctgctagg ccgacaccaa 
gtgctccagc agttgccgcg cccagccccg 
gggtccgcag tccccccgcg acgcgagcca 
gtggcaggct cgcctggcgc tggctggcgt 
acgtcttctc tgccgcgccc tctgcgcaag 
ccaagacctg tcgcaacccg aaggtctgaa 
tctgcagaag aggattatgg tgctggatgg 
gctaaacgaa gaacacttcc gaggtcagga 
caacaatgac attttaagta taactcagcc 
cttgctggct ggggcagata tcattgaaac 
agctgactat ggccttgaac acttggccta 
cagaaaagct gccgaggagg taactctcca 
tctgggtccg actaataaga cactctctgt 
gaacatcaca tttgatgagc ttgttgaagc 
tggcggggtt gatatcttac tcattgaaac 
cttgtttgca ctccaaaatc tttttgagga 
agggacgatc gttgataaaa gtgggcggac 
catcagcgtg tctcatggag aaccactcta 
tgaaatgaga ccttttattg aaataattgg 
tcccaatgca ggtcttccca acacctttgg 
caagcaccta aaggattttg ctatggatgg 
gtcaacacca gatcatatca gggaaattgc 
tccacctgcc actgcttttg aaggacatat 
tggaccgtac accaactttg ttaacattgg 
gtttgctaaa ctcatcatgg caggaaacta 
ggtggaaatg ggagcccagg tgttggatgt 
aagtgcaatg accagatttt gcaacttaat 
tttgtgcatc gactcctcca attttgctgt 
gaagtgcatt gtcaatagca ttagtctgaa 



ggactggccg ggtacccggg aagaaagcac 60 • 
agagaggccc tagggcgctg cgggctttcg 120 
acgggaggcg tcaaaagacc cgggccttgt 180 
ggcccttggc cgtcgtcacc tgtggagagc 240 
gaggagactc gacaacatgt cacccgcgct 300 
gaaaaccctg cgggatgaga tcaatgccat 360 
agggatgggg accatgatcc agcgggagaa 420 
atttaaagat catgccaggc cgctgaaagg 480 
tgatgtcatt taccaaatcc ataaggaata 540 
aaatactttt agcagcacta gtattgccca 600 
ccggatgaac atgtgctctg caggagtggc 660 
gacaggaatt aagaggtttg tggcaggggc 720 
gtccccatct gtggaaaggc cggattatag 780 
ataccaagag caggccaaag gacttctgga 840 
tatttttgat actgccaatg ccaaggcagc 900 
gaaatatgct ccccggccta tctttatttc 960 
tctttccgga cagacaggag agggatttgt 1020 
cattggatta aattgtgctt tgggtgcagc 1080 
aaaatgtaca acagcctatg tcctctgtta 1140 
tgactatgat gaaacgcctt ctatgatggc 1200 
cttggtcaat atagttggag gatgctgtgg 1260 
tgaagctgtg aaaaattgta agcctagagt 1320 
gttactgtct ggtctagagc ccttcaggat 1380 
agagcgctgt aatgttgcag gatcaaggaa 1440 
tgaagaagcc ttgtgtgttg ccaaagtgca 1500 
caacatggat gatggcatgc tagatggtcc 1560 
tgcttccgag ccagacatcg caaaggtacc 1620 
gattgaagct gggttaaagt gctgccaagg 1680 
ggaaggagag gacgacttct tggagaaggc 1740 
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tcgggagaac caaaccggaa tggtcacaaa 
ctggcggctg agaccagcct gctgaaggag 
ctcaccatca actcacagcc caacatcaac 
tggggcccca gcgggggcta tgtcttccag 
gagacagcgg aagcacttct gcaagtgctg 
cttgtcaatg tgaagggtga aaacatcacc 
acttggggca tcttccctgg gcgagagatc 
ttcatgttct ggaaggacga ggcctttgcc 
gaggaggagt ccccgtcccg caccatcatc 
aacctggtgg acaatgactt cccactggac 
ttggagcttc tcaacaggcc cacccagaat 
gtcctgacgc cctgcgttgg agccactcct 
ctcttgggaa ctccactctc cttcgtgtct 
acaatggcag ctagactgga gtgaggcttc 
catgggaacc tagtactctc tgctcta 



gtgacttgcc tgccctggaa cgatgagccc 1380 
gagctgctgc gggtgaaccg ccagggcatc 1440 
gggaagccgt cctccgaccc catcgtgggc 1500 
aaggcctact tagagttttt cacttcccgc 1560 
aagaagtacg agctccgggt taattaccac 1620 
aatgcccctg aactgcagcc gaatgctgtc 1680 
atccagccca ccgtagtgga tcccgtcagc 1740 
ctgtggattg agcggtgggg aaagctgtat 1800 
cagtacatcc acgacaacta cttcctggtc 1860 
aactgcctct ggcaggtggt ggaagacaca 1920 
gcgagagaaa cggaggctcc atgaccctgc 1980 
gtcccgcctt cctcctccac agtgctgctt 2040 
ctcccacccc ggcctccact cccccacctg 2100 
caggctcttc ctggacctga gtcggcccca 2160 

2187 



<210> 2 

<211> 7122 

<212> DNA 

<213> Homo sapiens 

<400> 2 

gcgcgtgtct ggctgctagg ccgacaccaa ggactggccg ggtacccggg aagaaagcac 60 
gtgctccagc agttgccgcg cccagccccg agagaggccc tagggcgctg cgggctttcg 120 
gggtccgcag tccccccgcg acgcgagcca acgggaggcg tcaaaagacc cgggccttgt 180 
gtggcaggct cgcctggcgc tggctggcgt ggcccttggc cgtcgtcacc tgtggagagc 240 
acgtcttctc tgccgcgccc tctgcgcaag gaggagactc gacaacatgt cacccgcgct 300 
ccaagacctg tcgcaacccg aaggtctgaa gaaaaccctg cgggatgaga tcaatgccat 3 60 
tctgcagaag aggattatgg tgctggatgg agggatgggg accatgatcc agcgggagaa 420 
gctaaacgaa gaacacttcc gaggtcagga atttaaagat catgccaggc cgctgaaagg 480 
caacaatgac attttaagta taactcagcc tgatgtcatt taccaaatcc ataaggaata 540 
cttgctggct ggggcagata tcattgaaac aaatactttt agcagcacta gtattgccca 600 
agctgactat ggccttgaac acttggccta, ccggatgaac atgtgctctg caggagtggc 660 
cagaaaagct gccgaggagg taactctcca gacaggaatt aagaggtttg tggcaggggc 720 
tctgggtccg actaataaga cactctctgt gtccccatct gtggaaaggc cggattatag 780 
gaacatcaca tttgatgagc ttgttgaagc ataccaagag caggccaaag gacttctgga 840 
tggcggggtt gatatcttac tcattgaaac tatttttgat actgccaatg ccaaggcagc 900 
cttgtttgca ctccaaaatc tttttgagga gaaatatgct ccccggccta tctttatttc 960 
agggacgatc gttgataaaa gtgggcggac tctttccgga cagacaggag agggatttgt 1020 
catcagcgtg tctcatggag aaccactcta cattggatta aattgtgctt tgggtgcagc 1080 
tgaaatgaga ccttttattg aaataattgg aaaatgtaca acagcctatg tcctctgtta 1140 
tcccaatgca ggtcttccca acacctttgg tgactatgat gaaacgcctt ctatgatggc 1200 
caagcaccta aaggattttg ctatggatgg cttggtcaat atagttggag gatgctgtgg 1260 
gtcaacacca gatcatatca gggaaattgc tgaagctgtg aaaaattgta agcctagagt 1320 
tccacctgcc actgcttttg aaggacatat gttactgtct ggtctagagc ccttcaggat 1380 
tggaccgtac accaactttg ttaacattgg agagcgctgt aatgttgcag gatcaaggaa 1440 
gtttgctaaa ctcatcatgg caggaaacta tgaagaagcc ttgtgtgttg ccaaagtgca 1500 
ggtggaaatg ggagcccagg tgttggatgt caacatggat gatggcatgc tagatggtcc 1560 
aagtgcaatg accagatttt gcaacttaat tgcttccgag ccagacatcg caaaggtacc 1620 
tttgtgcatc gactcctcca attttgctgt gattgaagct gggttaaagt gctgccaagg 1680 
gaagtgcatt gtcaatagca ttagtctgaa ggaaggagag gacgacttct tggagaaggc 1740 
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caggaagatt aaaaagtatg gagctgctat 
ggcaacagaa acagacacaa aaatcagagt 
aaaactgggc tttaatccaa atgacattat 
tggaatggag gaacacaact tgtatgccat 
agaaacatta cctggagcca gaataagtgg 
aggaatggaa gccattcgag aagcaatgca 
tggcatggac atggggatag tgaatgctgg 
ggaacttctg cagctctgtg aagatctcat 
gctcttacgt tatgcccaga ctcaaggcac 
gtggagaaat ggccctgtcg aagaacgcct 
acatattatt gaggatactg aggaagccag 
caatataatt gaaggacccc tgatgaatgg 
tggaaaaatg tttctacctc aggttataaa 
ccaccttatc cctttcatgg aaaaagaaag 
agaagaagag gacccttacc agggcaccat 
cgacataggc aagaacatag ttggagtagt 
tttaggagtc atgactccat gtgataagat 
tataattggc ctgtcaggac tcatcactcc 
ggaaatggag agattagcta taaggattcc 
aacccacaca gcagttaaaa tagctccgag 
cgcgtccaag agtgtggtgg tgtgttccca 
ctttgaggaa atcatggaag aatatgaaga 
ggagaggaga tacttaccct taagtcaagc 
gtctgaacct cacccagtga agcccacgtt 
cctgcagaag ctggtggact acattgactg 
gggcaagtac ccgaatcgag gctttcccaa 
ggccaggaag gtctacgatg atgcccacaa 
actccgggcc cggggtgtgg ttgggttctg 
cctgtacgca gaggctgctg tgccccaggc 
aaggcaacag gctgagaagg actctgccag 
catcgctccc ttgcattctg gcatccgtga 
tggggtagaa gagctgagca aggcctatga 
ggtcaaggcg ctgggggacc ggctggcaga 
tcgccgagaa ctgtgggcct actgtggcag 
gctgcggtac aagggcatcc gcccggctcc 
gaagctcacc atgtggagac tcgcagacat 
atcattagca atggcacctg cttcagcagt 
caaatatttt gctgtgggga agatttccaa 
gaacatatct gtggctgagg ttgagaaatg 
ctaacttttt ttttttttgc cttttttatt 
ggtgccttaa aaataacaac aacaaaaaac 
tctggttttc gaagactatt tagtggaacc 
ctggaaaaca ggcgctgttt ttttgggacc 
tggtttccct ggtccctctg agatggggac 
agcaagtcaa cctgcttttt tctgttttta 
tttttttcct cttagaagaa aagcctgaaa 
acaaaatgat actgtgaaaa atggggcatt 
gacggggaag gtgtagctct gttctcttcg 
atggctgcag aactcccttt ggcaaaaggc 
tgaagccatt tgccccagtg tggtcaagca 
ataatttcat atttccgtac ccctgaggaa 



ggtggtcatg gcttttgatg aagaaggaca 1800 
gtgcacccgg gcctaccatc tgcttgtgaa 1860 
ttttgaccct aatatcctaa ccattgggac 1920 
taattttatc catgcaacaa aagtcattaa 1980 
aggtctttcc aacttgtcct tctccttccg 2040 
tggggttttc ctttaccatg caatcaagtc 2100 
aaacctccct gtgtatgatg atatccataa 2160 
ctggaataaa gaccctgagg ccactgagaa 2220 
aggagggaag aaagtcattc agactgatga 2280 
tgagtatgcc cttgtgaagg gcattgaaaa 2340 
gttaaaccaa aaaaaatatc cccgacctct 2400 
aatgaaaatt gttggtgatc tttttggagc 2460 
gtcagcccgg gttatgaaga aggctgttgg 2520 
agaagaaacc agagtgctta acggcacagt 2580 
cgtgctggcc actgttaaag gcgacgtgca 2640 
ccttggctgc aataatttcc gagttattga 2700 
actgaaagct gctcttgacc acaaagcaga 27 60 
ttccctggat gaaatgattt ttgttgccaa 2820 
attgttgatt ggaggagcaa ccacttcaaa 2880 
atacagtgca cctgtaatcc atgtcctgga 2940 
gctgttagat gaaaatctaa aggatgaata 3000 
tattagacag gaccattatg agtctctcaa 3060 
cagaaaaagt ggtttccaaa tggattggct 3120 
tattgggacc caggtctttg aagactatga 3180 
gaagcctttc tttgatgtct ggcagctccg. 3240 
gatatttaac gacaaaacag taggtggaga 3300 
tatgctgaac acactgatta gtcaaaagaa 33 60 
gccagcacag agtatccaag acgacattca 3420 
tgcagagccc atagccacct tctatgggtt 3480 
cacggagcca tactactgcc tctcagactt 3540 
ctacctgggc ctgtttgccg ttgcctgctt 3600 
ggatgatggt gacgactaca gcagcatcat 3 660 
ggcctttgca gaagagctcc atgaaagagt 3720 
tgagcagctg gacgtcgcag acctgcgcag 3780 
tggctacccc agccagcccg accacaccga 3840 
cgagcagtct acaggcatta ggttaacaga 3900 
ctcaggcctc tacttctcca atttgaagtc 3960 
ggatcaggtt gaggattatg cattgaggaa 4020 
gcttggaccc attttgggat atgatacaga 4080 
cttgatgatc ctcaaggaaa tacaacctag 4140 
ctgtgtgcat ctggctgaca cttacctgct 4200 
ttgtagagga gcagggtctt cctgcagtgc 4260 
ttgcgtgaag agcagtgagc agggttcctg 4320 
agactgaaga cagaggtcgt ttgatttcaa 4380 
cagtggaatc taggaggcca cttagtcgtc 4440 
ctgagttgaa tagagaagtg tgaccctgtg 4500 
ttaatctaag tggttataac agtggattct 4560 
gaagacctcg ttttctaaag gctggactaa 4620 
atgcgctcac tgcttgcttg tcagaaacac 4680 
gccatgcttt ctgggcattt tcgtcctccc 4740 
acaaaaagga aatgaggaga gaaagttact 4800 
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caggaagatt aaaaagtatg gagctgctat ggtggtcatg gcttttgatg aagaaggaca 1800 
ggcaacagaa acagacacaa aaatcagagt gtgcacccgg gcctaccatc tgcttgtgaa 1860 
aaaactgggc tttaatccaa atgacattat ttttgaccct aatatcctaa ccattgggac 1920 
tggaatggag gaacacaact tgtatgccat taattttatc catgcaacaa aagtcattaa 1980 
agaaacatta cctggagcca gaataagtgg aggtctttcc aacttgtcct tctccttccg 2040 
aggaatggaa gccattcgag aagcaatgca tggggttttc ctttaccatg caatcaagtc 2100 
tggcatggac atggggatag tgaatgctgg aaacctccct gtgtatgatg atatccataa 2160 
ggaacttctg cagctctgtg aagatctcat ctggaataaa gaccctgagg ccactgagaa 2220 
gctcttacgt tatgcccaga ctcaaggcac aggagggaag aaagtcattc agactgatga 2280 
gtggagaaat ggccctgtcg aagaacgcct tgagtatgcc cttgtgaagg gcattgaaaa 2340 
acatattatt gaggatactg aggaagccag gttaaaccaa aaaaaatatc cccgacctct 2400 
caatataatt gaaggacccc tgatgaatgg aatgaaaatt gttggtgatc tttttggagc 2460 
tggaaaaatg tttctacctc aggttataaa gtcagcccgg gttatgaaga aggctgttgg 2520 
ccaccttatc cctttcatgg aaaaagaaag agaagaaacc agagtgctta acggcacagt 2580 
agaagaagag gacccttacc agggcaccat cgtgctggcc actgttaaag gcgacgtgca 2640 
cgacataggc aagaacatag ttggagtagt ccttggctgc aataatttcc gagttattga 2700 
tttaggagtc atgactccat gtgataagat actgaaagct gctcttgacc acaaagcaga 2760 
tataattggc ctgtcaggac tcatcactcc ttccctggat gaaatgattt ttgttgccaa- 2820 
ggaaatggag agattagcta taaggattcc attgttgatt ggaggagcaa ccacttcaaa 2880 
aacccacaca gcagttaaaa tagctccgag atacagtgca cctgtaatcc atgtcctgga 2940 
cgcgtccaag agtgtggtgg tgtgttccca gctgttagat gaaaatctaa aggatgaata 3000 
ctttgaggaa atcatggaag aatatgaaga tattagacag gaccattatg agtctctcaa 3060 
ggagaggaga tacttaccct taagtcaagc cagaaaaagt ggtttccaaa tggattggct 3120 
gtctgaacct cacccagtga agcccacgtt tattgggacc caggtctttg aagactatga 3180 
cctgcagaag ctggtggact acattgactg gaagcctttc tttgatgtct ggcagctccg 3240 
gggcaagtac ccgaatcgag gctttcccaa . gatatttaac gacaaaacag taggtggaga 3300 
ggccaggaag gtctacgatg atgcccacaa tatgctgaac acactgatta gtcaaaagaa 3360 
actccgggcc cggggtgtgg ttgggttctg gccagcacag agtatccaag acgacattca 3420 
cctgtacgca gaggctgctg tgccccaggc tgcagagccc atagccacct tctatgggtt 3480 
aaggcaacag gctgagaagg actctgccag cacggagcca tactactgcc tctcagactt 3540 
catcgctccc ttgcattctg gcatccgtga ctacctgggc ctgtttgccg ttgcctgctt 3 600 
tggggtagaa gagctgagca aggcctatga ggatgatggt gacgactaca gcagcatcat 3660 
ggtcaaggcg ctgggggacc ggctggcaga ■ ggcctttgca gaagagctcc atgaaagagt 3720 
tcgccgagaa ctgtgggcct actgtggcag tgagcagctg gacgtcgcag acctgcgcag 3780 
gctgcggtac aagggcatcc gcccggctcc tggctacccc agccagcccg accacaccga 3840 
gaagctcacc atgtggagac tcgcagacat cgagcagtct acaggcatta ggttaacaga 3900 
atcattagca atggcacctg cttcagcagt ctcaggcctc tacttctcca atttgaagtc 3960 
caaatatttt gctgtgggga agatttccaa ggatcaggtt gaggattatg cattgaggaa 4020 
gaacatatct gtggctgagg ttgagaaatg gcttggaccc attttgggat atgatacaga 4080 
ctaacttttt ttttttttgc cttttttatt cttgatgatc ctcaaggaaa tacaacctag 4140 
ggtgccttaa aaataacaac aacaaaaaac ctgtgtgcat ctggctgaca cttacctgct 4200 
tctggttttc gaagactatt tagtggaacc ttgtagagga gcagggtctt cctgcagtgc 4260 
ctggaaaaca ggcgctgttt ttttgggacc ttgcgtgaag agcagtgagc agggttcctg 4320 
tggtttccct ggtccctctg agatggggac agactgaaga cagaggtcgt ttgatttcaa 4380 
agcaagtcaa cctgcttttt tctgttttta cagtggaatc taggaggcca cttagtcgtc 4440 
tttttttcct cttagaagaa aagcctgaaa ctgagttgaa tagagaagtg tgaccctgtg 4500 
acaaaatgat actgtgaaaa atggggcatt ttaatctaag tggttataac agtggattct 4 560 
gacggggaag gtgtagctct gttctcttcg gaagacctcg ttttctaaag gctggactaa 4620 
atggctgcag aactcccttt ggcaaaaggc atgcgctcac tgcttgcttg tcagaaacac 4680 
tgaagccatt tgccccagtg tggtcaagca gccatgcttt ctgggcattt tcgtcctccc 4740 
ataatttcat atttccgtac ccctgaggaa acaaaaagga aatgaggaga gaaagttact 4800 
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gttaagggtg gttaacattt tttttgtttt gttttgtttt ggtttttttt ttttgagaca 4860 
gagtctggct ctgtcgccca ggctggagtg caggggcgca atctcggctc atagcaagct 4920 
ccgcctcctg ggttcatgcc attctcctgc ctcagcctcc agagtagctg ggactacagg 4980 
tgcccaccac cacacccggc taattttttg tgtttttaca aaatacaaaa aagtagagac 5040 
aggatttcac tgtgttagcc aggatggtct tgatctcccg acctcgtgat ctgcccacct 5100 
cagcctccca aaatgctggg attacaggcg tgagccaccg agcctggccg gttaacatct 5160 
tttaattgtt tccaggattg agcaggttct cagctgggct ctgatatccc gtgcggagtt 5220 
ggacaagtgg gcagcataaa gtcactcatt tcttaccatt ttattcccct caattctcaa 5280 
tatattcagt aatgaagaat ggtgccacca ctcaagcaac aagcctcaaa ctcaaccatg 5340 
tcatcttttt cttggatgat tgcagttatt tcaaaaattt gcatgcaaaa tatacactca 5400 
tcctacttca agatggtggt ggcaatagtc aggagaaggt aacattggag tcctggtttg 5460 
attcgaagga tgaagacgaa gaagcaaggg aggaacaaat gaagaaccat ctttgttcat 5520 
gaataggaat attcaagatt ataaaggtat caggtctcct aaaattgatc tatggattta 5580 
ataccatttt caatggaaat tccaacagat tttattgaat gaaacaagca ggtgtttata 5640 
tggagtagca aaggacttaa aattaccaaa tgcttctaaa tatgaaggag aggttgggga 5700 
cacgcaccct atgtgatacc aagttttatt gtcaagacag tgtcatggtg cagaggtagg 57 60 
cattctgagc aggggaacaa aataagggcc tagaaactca cccgtgcata tgttgacctt 5820 
tgcaaaatga cctggtgaca tggcaagtca gtggggacag gaaggaccac tccctaagta 5880 
atcccagaac aatggctatt catgtgggaa aaaaagaaat tttactttct ctcaccttac 5940 
ctggtgataa gttccaaata tgttaagggc tttaatacaa aaagcaaaaa ttgtcagtgt 6000 
ttggatgaaa aaagccttag ggcaggaaag aatctcttga gacataaagt agtaatcata 6060 
aaggacaaga tggttaagtc aattctgtta aaactcaagg cttatattaa gcaaacactt 6120 
gaagtgagaa gatgatccac aacttgagaa gacatttata atacaaataa ctgatgaagg 6180 
attcataatc acaaatatag agaattccta tttaaaaaaa tagaaaaata gtgaagacta 6240 
cacaagagga aatagggctt ttaaataaat agatgttctg tagcattggt cagggaaata 6300 
tgaattagga ccacaatgag attccatttt atatccataa gatttgcaaa ggttgggtct 6360 
gacagtacca gttgttagat ctgtagggac ttgtacaaca ttgtggatgt gtaaacaggc 6420 
accactgctt taaaaaacaa ttatccctta cagacttgaa catttgcaga cgttatgatc 6480 
ttgcttccaa ctcccacctg tatgtccagc aaactcttgc atgtggccac taggaggaat 6540 
gtgtaagaat gttcatagtt acatatttat aatagttaat aactggaaaa agtgaaatgt 6600 
atgtctgtct acaggaaaat aggtgaataa ttagatatat atattcattc tacgggatat 6660 
tattcagtag tggaaatgag tgaactacag ctatacctca caataagaat gaatctcaga 6720 
aaatattaag gaaaaaagca agtttgaaga gaccacatgg ggcgtactat ttttattggg 6780 
cccaaaaaca agcaaaacca aagaatatgt agtctaagca tacgtataca ataaaactat 6840 
gctattaaaa aaaaaaggta actgataaac caaaattgag catagtaatt acccacagaa 6900 
ggaggaagtg gaagggacag gagcacatag gtagatgcca agttatgcag ctgttctggt 6960 
tcctcctggt aggcttacaa gtgtttacta tatgctatta atacattata ctttataact 7020 
aatagataac agttttttac atattaaata tgttctactt aaatatatta taaaaaataa 7080 
aggcaaagtg gaatgtttaa aaaaaaaaaa aaaaaaaaaa aa 7122 

<210> 3 

<211> 564 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atggttggtt cgctaaactg catcgtcgct gtgtcccaga acatgggcat cggcaagaac 60 
ggggacctgc cctggccacc gctcaggaat gaattcagat atttccagag aatgaccaca 120 
acctcttcag tagaaggtaa acagaatctg gtgattatgg gtaagaagac ctggttctcc 180 
attcctgaga agaatcgacc tttaaagggt agaattaatt tagttctcag cagagaactc 240 
aaggaacctc cacaaggagc teattttctt tccagaagtc tagatgatgc cttaaaactt 300 
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gttaagggtg gttaacattt tttttgtttt gttttgtttt ggtttttttt ttttgagaca 4860 
gagtctggct ctgtcgccca ggctggagtg caggggcgca atctcggctc atagcaagct 4920 
ccgcctcctg ggttcatgcc attctcctgc ctcagcctcc agagtagctg ggactacagg 4980 
tgcccaccac cacacccggc taattttttg tgtttttaca aaatacaaaa aagtagagac 5040 
aggatttcac tgtgttagcc aggatggtct tgatctcccg acctcgtgat ctgcccacct 5100 
cagcctccca aaatgctggg attacaggcg tgagccaccg agcctggccg gttaacatct 5160 
tttaattgtt tccaggattg agcaggttct cagctgggct ctgatatccc gtgcggagtt 5220 
ggacaagtgg gcagcataaa gtcactcatt tcttaccatt ttattcccct caattctcaa 5280 
tatattcagt aatgaagaat ggtgccacca ctcaagcaac aagcctcaaa ctcaaccatg 5340 
tcatcttttt cttggatgat tgcagttatt tcaaaaattt gcatgcaaaa tatacactca 5400 
tcctacttca agatggtggt ggcaatagtc aggagaaggt aacattggag tcctggtttg .5460 
attcgaagga tgaagacgaa gaagcaaggg aggaacaaat gaagaaccat ctttgttcat 5520 
gaataggaat attcaagatt ataaaggtat caggtctcct aaaattgatc tatggattta 5580 
ataccatttt caatggaaat tccaacagat tttattgaat gaaacaagca ggtgtttata 5640 
tggagtagca aaggacttaa aattaccaaa tgcttctaaa tatgaaggag aggttgggga 5700 
cacgcaccct atgtgatacc aagttttatt gtcaagacag tgtcatggtg cagaggtagg 57 60 
cattctgagc aggggaacaa aataagggcc tagaaactca cccgtgcata tgttgacctt 5820 
tgcaaaatga cctggtgaca tggcaagtca gtggggacag gaaggaccac tccctaagta 5880 
atcccagaac aatggctatt catgtgggaa aaaaagaaat tttactttct ctcaccttac -5940 
ctggtgataa gttccaaata tgttaagggc tttaatacaa aaagcaaaaa ttgtcagtgt 6000 
ttggatgaaa aaagccttag ggcaggaaag- aatctcttga gacataaagt agtaatcata 6060- 
aaggacaaga tggttaagtc aattctgtta aaactcaagg cttatattaa gcaaacactt 6120 
gaagtgagaa gatgatccac aacttgagaa gacatttata atacaaataa ctgatgaagg 6180 
attcataatc acaaatatag agaattccta tttaaaaaaa tagaaaaata gtgaagacta 6240 
cacaagagga aatagggctt ttaaataaat agatgttctg tagcattggt cagggaaata 6300 
tgaattagga ccacaatgag attccatttt atatccataa gatttgcaaa ggttgggtct 6360 
gacagtacca gttgttagat ctgtagggac ttgtacaaca ttgtggatgt gtaaacaggc 6420 
accactgctt taaaaaacaa ttatccctta cagacttgaa catttgcaga cgttatgatc 6480 
ttgcttccaa ctcccacctg tatgtccagc aaactcttgc atgtggccac taggaggaat 6540 
gtgtaagaat gttcatagtt acatatttat aatagttaat aactggaaaa agtgaaatgt 6600 
atgtctgtct acaggaaaat aggtgaataa ttagatatat atattcattc tacgggatat 6660 
tattcagtag tggaaatgag tgaactacag ctatacctca caataagaat gaatctcaga 6720 
aaatattaag gaaaaaagca agtttgaaga gaccacatgg ggcgtactat ttttattggg 6780 
cccaaaaaca agcaaaacca aagaatatgt agtctaagca tacgtataca ataaaactat 6840 
gctattaaaa aaaaaaggta actgataaac caaaattgag catagtaatt acccacagaa 6900 
ggaggaagtg gaagggacag gagcacatag gtagatgcca agttatgcag ctgttctggt 6960 
tcctcctggt aggcttacaa gtgtttacta tatgctatta atacattata ctttataact 7020 
aatagataac agttttttac atattaaata tgttctactt aaatatatta taaaaaataa 7080 
aggcaaagtg gaatgtttaa aaaaaaaaaa aaaaaaaaaa aa 7122 

<210> 3 
<211> 564 
<212> DNA 

<213> Homo sapiens 
<400> 3 

atggttggtt cgctaaactg catcgtcgct gtgtcccaga acatgggcat cggcaagaac 60 
ggggacctgc cctggccacc gctcaggaat gaattcagat atttccagag aatgaccaca 120 
acctcttcag tagaaggtaa acagaatctg gtgattatgg gtaagaagac ctggttctcc 180 
attcctgaga agaatcgacc ' tttaaagggt agaattaatt tagttctcag cagagaactc 240 
aaggaacctc cacaaggagc tcattttctt tccagaagtc tagatgatgc cttaaaactt 300 
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actgaacaac cagaattagc aaataaagta 
gtttataagg aagccatgaa tcacccaggc 
caagactttg aaagtgacac gttttttcca 
ccagaatacc caggtgttct ctctgatgtc 
gaagtatatg agaagaatga ttaa 

<210> 4 

<211> 2158 

<212> DNA 

<213> Homo sapiens 

<400> 4 

gcgcggcata acgacccagg tcgcggcgcg 
ggagccgagc atggagtacc aggatgccgt 
cggctacctg gagcaggtga agcgccagcg 
ggaactgtac ctggcacgga gtgggctgca 
ccacgtcact gggacgaagg ggaagggetc 
aagctatggc ctgaagacgg gattctttag 
gatccgcatc aatgggcagc ccatcagtcc 
ctaccaccgg ctggaggaga ccaaggatgg 
cttcctgaca ctcatggcct tccacgtctt 
ggaggtgggc attggcgggg cttatgactg 
cggagtctcc tctcttggca tcgaccacac 
cgcatggcag aaagggggca tctttaagca 
tgaaggtccc ctggcagtgc tgagggaccg 
gtgtccgatg ctggaggccc tcgaggaagg 
ggagcaccag cggtccaacg ccgccttggc 
gcaggaccgc catggtgctg gggagccaaa 
gcccctggca cctgtgttcc agcccacatc 
gtggccgggc cggacgcagg tgctgcggcg 
gcacaccgcc agcagcgcgc aggcctgcgt 
cgagaggccg agcggtggcc ccgaggttcg 
ggacccggcg gccctgctga agctgctgca 
ccctaacctg acagaggtgt catccacagg 
actggaccag gtcctgctcc gctgcctgga 
agagcaggcc agcccggacc tctggagtgc 
cctgcttctg gcgccccacc caccccacac 
catttcacat gccttgcaat ggatcagcca 
tcccccaaag ggcctcctca cccaccctgt 
ggctgctgcc atccatgtgc tagtcactgg 
gctgctggag cccgcactgt cccagtagcc 
acacctgcct gcgttctccc catgaactta 
ctggttctgt ctagactggc ctaggggcca 
gtctttttta aggctctgtg ccttggtctc 
ctccccgggt ctctcactgt tgcagtggcc 
cgcctgcctc ctggctcagg cccagcttat 
ttgccatgtg ctgggtggta gatttcctcc 
tctgcctggg acactgcggg acagagggtg 

<210> 5 
<211> 7720 



gacatggtct ggatagttgg tggcagttct 3 60 
catcttaaac tatttgtgac aaggatcatg 420 
gaaattgatt tggagaaata taaacttctg 480 
caggaggaga aaggcattaa gtacaaattt 540 

564 



gcggggcttg agcgcgtggc cggtgccgca 60 

gcgcatgctc aataccctgc agaccaatgc 120 

gggtgaccct cagacacagt tggaagccat 180 

ggtggaggac ttggaccggc tgaacatcat 240 . 

cacctgtgcc ttcacggaat gtatcctccg 300 

ctctccccac. ctggtgcagg ttcgggagcg 360 

tgagctcttc accaagtact tctggcgcct 420 

cagctgtgtc tccatgcccc cctacttccg 480 

cctccaagag aaggtggacc tggcagtggt 540 

caccaacatc atcaggaagc ctgtggtgtg 600 

cagcctcctg ggggatacgg tggagaagat 660 

aggtgtccct gccttcactg tgctccaacc 720 

agcccagcag atctcatgtc ctctatacct 780 

ggggccgccg ctgaccctgg gcctggaggg 840 

cttgcagctg gcccactgct ggctgcagcg 900 

ggcatccagg ccagggctcc tgtggcagct 960 

ccacatgcgg ctcgggcttc ggaacacgga 1020 

cgggcccctc acctggtacc tggacggtgc 1080 

gcgctggttc cgccaggcgc tgcagggccg 1140 

agtcttgctc ttcaatgcta ccggggaccg 1200 

gccctgccag tttgactatg ccgtcttctg 1260 

caacgcagac caacagaact tcacagtgac 1320 

acaccagcag cactggaacc acctggacga 1380 

ccccagccca gagcccggtg ggtccgcatc 1440 

ctgcagtgcc agctccctcg tcttcagctg 1500 

aggccgagac cccatcttcc agccacctag 1560 

ggctcacagt ggggccagca tactccgtga 1620 

cagcctgcac ctggtgggtg gtgtcctgaa 1680 

aaggcccggg gttggaggtg ggagcttccc 1740 

catactaggt gccttttgtt tttggctttc 1800 

gggctttggg atgggaggcc gggagaggat 1860 

tccttcctct tggctgagat agcagagggg 1920 

tggccgttca gcctgtctcc cccaacaccc 1980 

tgtgtgcgct gcctggccag gccctgggtc 2040 

tcccagtgcc ttctgggaag ggagagggcc 2100 

gctggagtga attaaagcct ttgttttt 2158 
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actgaacaac cagaattagc aaataaagta gacatggtct ggatagttgg tggcagttct 360 
gtttataagg aagccatgaa tcacccaggc catcttaaac tatttgtgac aaggatcatg 420 
caagactttg aaagtgacac gttttttcca gaaattgatt tggagaaata taaacttctg 480 
ccagaatacc caggtgttct ctctgatgtc caggaggaga aaggcattaa gtacaaattt 540 
gaagtatatg agaagaatga ttaa 564 

<210> 4 

<211> 2158 

<212> DNA 

<213> Homo sapiens 

<400> 4 

gcgcggcata acgacccagg tcgcggcgcg gcggggcttg agcgcgtggc cggtgccgca 60 
ggagccgagc atggagtacc aggatgccgt gcgcatgctc aataccctgc agaccaatgc 120 
cggctacctg gagcaggtga agcgccagcg gggtgaccct cagacacagt tggaagccat 180 - 
ggaactgtac ctggcacgga gtgggctgca ggtggaggac ttggaccggc tgaacatcat 240 
ccacgtcact gggacgaagg ggaagggctc cacctgtgcc ttcacggaat gtatcctccg 300 
aagctatggc ctgaagacgg gattctttag ctctccccac ctggtgcagg ttcgggagcg 360 
gatccgcatc aatgggcagc ccatcagtcc tgagctcttc accaagtact tctggcgcct 420 
ctaccaccgg ctggaggaga ccaaggatgg cagctgtgtc tccatgcccc cctacttccg .480 
cttcctgaca ctcatggcct tccacgtctt cctccaagag aaggtggacc tggcagtggt 540 
ggaggtgggc attggcgggg cttatgactg caccaacatc atcaggaagc ctgtggtgtg 600 
cggagtctcc tctcttggca tcgaccacac cagcctcctg ggggatacgg tggagaagat 660 
cgcatggcag aaagggggca tctttaagca aggtgtccct gccttcactg tgctccaacc 720 
tgaaggtccc ctggcagtgc tgagggaccg agcccagcag atctcatgtc ctctatacct 780 
gtgtccgatg ctggaggccc tcgaggaagg ggggccgccg ctgaccctgg gcctggaggg 840 
ggagcaccag cggtccaacg ccgccttggc cttgcagctg gcccactgct ggctgcagcg 900 
gcaggaccgc catggtgctg gggagccaaa ggcatccagg ccagggctcc tgtggcagct 960 
gcccctggca cctgtgttcc agcccacatc ccacatgcgg ctcgggcttc ggaacacgga 1020 
gtggccgggc cggacgcagg tgctgcggcg cgggcccctc acctggtacc tggacggtgc 1080 
gcacaccgcc agcagcgcgc aggcctgcgt gcgctggttc cgccaggcgc tgcagggccg 1140 
cgagaggccg agcggtggcc ccgaggttcg agtcttgctc ttcaatgcta ccggggaccg 1200 
ggacccggcg gccctgctga agctgctgca gccctgccag tttgactatg ccgtcttctg 1260 
ccctaacctg acagaggtgt catccacagg caacgcagac caacagaact tcacagtgac 1320 
actggaccag gtcctgctcc gctgcctgga acaccagcag cactggaacc acctggacga 1380 
agagcaggcc agcccggacc tctggagtgc ccccagccca gagcccggtg ggtccgcatc 1440 
cctgcttctg gcgccccacc caccccacac ctgcagtgcc agctccctcg tcttcagctg 1500 
catttcacat gccttgcaat ggatcagcca aggccgagac cccatcttcc agccacctag 1560 
tcccccaaag ggcctcctca cccaccctgt ggctcacagt ggggccagca tactccgtga 1620 
ggctgctgcc atccatgtgc tagtcactgg cagcctgcac ctggtgggtg gtgtcctgaa 1680 
gctgctggag cccgcactgt cccagtagcc aaggcccggg gttggaggtg ggagcttccc 1740 
acacctgcct gcgttctccc catgaactta catactaggt gccttttgtt tttggctttc 1800 
ctggttctgt ctagactggc ctaggggcca gggctttggg atgggaggcc gggagaggat 1860 
gtctttttta aggctctgtg ccttggtctc tccttcctct tggctgagat agcagagggg 1920 
ctccccgggt ctctcactgt tgcagtggcc tggccgttca gcctgtctcc cccaacaccc 1980 
cgcctgcctc ctggctcagg cccagcttat tgtgtgcgct gcctggccag gccctgggtc 2040 
ttgccatgtg ctgggtggta gatttcctcc tcccagtgcc ttctgggaag ggagagggcc 2100 
tctgcctggg acactgcggg acagagggtg gctggagtga attaaagcct ttgttttt 2158 



<210> 5 
<211> 7720 
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<212> DNA 

<213> Homo sapiens 

<400> 5 

taagttgaca cttctcaggt tgtcacaaga 
agctgggatc tcctgggaat tggtctgctt 
gattttcctc tagagatcca actgtcttct 
cctttcttgt ggcattgcca gattactggg 
atagtctgat ggttcccaca tctgcatcca 
tacaaccacc ttccccgggc caagggcaca 
ttcttcctgc ccaaacctgc ccctccctct 
ctcagagaat aaaaaaaatg aaggcttctg 
gttagcccag aatcacaggt gctgtagaaa 
atgagttggg gatggaagga gagcttggcc 
ttcagaacat ctgtgatttt gtggctggtg 
ggagttggtg gctctacaat ttatggcctt 
gaggttggaa gcctgggctc tagcccttgc 
tgcctaacag gacagagggg ttggaaactg 
caggctgcaa gatttttttc agacctcagt 
agggactggc cttaggcctg cactgttaat 
ggctggtgct cacatacaat aattaactgc 
ctcctgggct ccattcccac tccctgcctg 
ctggaataag gcaaggggga gtgtagagca 
cctcctctcc caggtatgtg acactcccca 
attcccacca tgtgttaagg attttctgaa 
agagaatctt gaagtggaga ctgaggccca 
gacaagttag tgttcatccc cctgaactag 
tgagttctca gctcttctag tctggggccc 
gtggggaggg gtcacagctg gactaagacc 
gcttgcagag tgatgctgct caaaaggaca 
agcaagagag actactgact gaacaggtgg 
tcagccccca tccccagcac tgtgtgttgg 
ggtgcagggg gcaaaggcca aaagagctct 
acttggggca tggccctcat ctgtgctgaa 
tttgttgatt tcccccttct tacatttaat 
gtttgcttct ctttccccca aggccaagga 
ggttgaacgg gaaccctgtg ctctaaacag 
aaaggatcac ctggtattcc ctgagagtac 
tgagtgagca ggtccacagg ggcatgattg 
gagtgaatga acactggaat caatagagta 
gagctgctgg gtgggaattc aattccaggc 
gagacagccc agctcaggcc ctgcctagac 
ggaggggcag cacgggggca aggcaagctt 
actggtatgt ggaagatgag ttagaggaga 
ctgagcattt taggagggcc cagacacctg 
tcccaaaatt cagtgtctga ggtcaaagga 
agctctgtgg tagggggcac aggagctccc 
ccctgccagc acccatgtcc tgtgacccca 
ggcctactaa actagcttga gtgatgaggc 
gcaaaacaaa ctaacaaaaa ccacactgca 
tttttttttt ttttgagatg gagtctcgct 



ttcaggtatg gctcactgtt gcaggacata 60 
gcaggcccta gagagccttc cttcttggtt 120 
caggctcccc tgcctgcctc ctccttgggt 180 
cccccatttt ccctacactt actgccactc 240 
acctggactc ttcccctgag ctttcccctc 300 
caggcacctc gacaaaacag tgttctatgt 360 
cccttttccc atctgtggta ccaccatggg 420 
tcattgactg gggtggagat ggagggaaga 480 
ggatacctga gttgccggga gagggggtcc 540 
cttcaaacaa ttgaagatct gatcaaaaga 600 
atgggtgaca cctgggctaa tggggttggg 660 
gggagatcct tgctctctat agctgactgg 720 
cttgatcctc cggatctcat tttcctcatc 780 
atgagattag ctcaaaggat cctggcagct 840 
gtttgggaaa aaattgggta ggtggagctt 900 
tcaccccctc ccactacccc atggaggcct 960 
tgagtggcct tcgcccaatc ccaggctcca 1020 
tctcctaggc cactaaacca cagctgtccc 1080 
gagcagaagc ctgagccaga cggagagcca 1140 
tcccccttca gaggccacac accctatggc 1200 
ctggaagggc cctctgtttg cctgaaggcc 1260 
gaccagagtg tggcctgctc aagattaaac 1320 
tacctgggct ctagcccttc agtccagagc 1380 
caaggttggg tgtgggggtc atgattgttg 1440 
tgaaggtgag actaggcagg tgggaaagga 1500 
ggaagagagc ctggcttcag aagcagccac 1560 
gctccactgg gggctccgga aaggattttc 1620 
ccgcacccat gagagcctca gcactctgaa 1680 
ggcctgaact tgggtggtcc ctactgtgtg 1740 
atgattccac aaagattaaa ctggctatca 1800 
ccttgcagga gaaagctaag cctcaagata 1860 
gaaggtggag tgagggctgg ggtcgggaca 1920 
ttagggtttg ttcccgcagg aactgaaccc 1980 
agatttctcc ggcgtggccc tcaaggttag 2040 
gatcctggaa tgaatgaatc aaccatgaga 2100 
gcagagtaat ggattgtgga gcaggaaaga 2160 
ttatatgagc cctgctgtgc agtcggcctg 2220 
ccctgtcaag gaggccctgt caagaggaga 2280 
gtgagcggga aaggcatgtc cactttagcg 2340 
cagatggaga gaagtcatag gaaataaatt 2400 
gtgtccagtg gagtgaagga aacagtcgcc 24 60 
ttgaagttct gtgatgacca aggagaagcc 2520 
caaggcccca gggctgtcca gctggctgtc 2580 
ccccaccaag atcccatggt ttccgggaag 2640 
tagaaagggg ctgggaccaa ggtttaaaaa 2700 
gcccccccaa ctaaaacatt tttataaact 2760 
ctgtcaccca ggctagagtg caatggcaca 2820 
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<212> DNA 

<213> Homo sapiens 

<400> 5 

taagttgaca cttctcaggt tgtcacaaga ttcaggtatg gctcactgtt gcaggacata 60 
agctgggatc tcctgggaat tggtctgctt gcaggcccta gagagccttc cttcttggtt 120 
gattttcctc tagagatcca actgtcttct caggctcccc tgcctgcctc ctccttgggt 180 
cctttcttgt ggcattgcca gattactggg cccccatttt ccctacactt actgccactc 240 
atagtctgat ggttcccaca tctgcatcca acctggactc ttcccctgag ctttcccctc 300 
tacaaccacc ttccccgggc caagggcaca caggcacctc gacaaaacag tgttctatgt 360 
ttcttcctgc ccaaacctgc ccctccctct cccttttccc atctgtggta ccaccatggg 420 
ctcagagaat aaaaaaaatg aaggcttctg tcattgactg gggtggagat ggagggaaga 480 
gttagcccag aatcacaggt gctgtagaaa ggatacctga gttgccggga gagggggtcc 540 
atgagttggg gatggaagga gagcttggcc cttcaaacaa ttgaagatct gatcaaaaga 600 
ttcagaacat ctgtgatttt gtggctggtg atgggtgaca cctgggctaa tggggttggg 660 
ggagttggtg gctctacaat ttatggcctt gggagatcct tgctctctat agctgactgg 720 
gaggttggaa gcctgggctc tagcccttgc cttgatcctc cggatctcat tttcctcatc 780 
tgcctaacag gacagagggg ttggaaactg atgagattag ctcaaaggat cctggcagct 84 0 
caggctgcaa gatttttttc agacctcagt gtttgggaaa aaattgggta ggtggagctt 900 
agggactggc cttaggcctg cactgttaat tcaccccctc ccactacccc atggaggcct 960 
ggctggtgct cacatacaat aattaactgc tgagtggcct tcgcccaatc ccaggctcca 1020 
ctcctgggct ccattcccac tccctgcctg tctcctaggc cactaaacca cagctgtccc 1080 
ctggaataag gcaaggggga gtgtagagca gagcagaagc ctgagccaga cggagagcca 1140 
cctcctctcc caggtatgtg acactcccca tcccccttca gaggccacac accctatggc 1200 
attcccacca tgtgttaagg attttctgaa ctggaagggc cctctgtttg cctgaaggcc 1260 
agagaatctt gaagtggaga ctgaggccca gaccagagtg tggcctgctc aagattaaac 1320 
gacaagttag tgttcatccc cctgaactag tacctgggct ctagcccttc agtccagagc 1380 
tgagttctca gctcttctag tctggggccc caaggttggg tgtgggggtc atgattgttg 1440 
gtggggaggg gtcacagctg gactaagacc tgaaggtgag actaggcagg tgggaaagga 1500 
gcttgcagag tgatgctgct caaaaggaca ggaagagagc ctggcttcag aagcagccac 1560 
agcaagagag actactgact gaacaggtgg gctccactgg gggctccgga aaggattttc 1620 
tcagccccca tccccagcac tgtgtgttgg ccgcacccat gagagcctca gcactctgaa 1680 
ggtgcagggg gcaaaggcca aaagagctct ggcctgaact tgggtggtcc ctactgtgtg 1740 
acttggggca tggccctcat ctgtgctgaa atgattccac aaagattaaa ctggctatca 1800 
tttgttgatt tcccccttct tacatttaat ccttgcagga gaaagctaag cctcaagata 1860 
gtttgcttct ctttccccca aggccaagga gaaggtggag tgagggctgg ggtcgggaca 1920 
ggttgaacgg gaaccctgtg ctctaaacag ttagggtttg ttcccgcagg aactgaaccc 1980 
aaaggatcac ctggtattcc ctgagagtac agatttctcc ggcgtggccc tcaaggttag 2040 
tgagtgagca ggtccacagg ggcatgattg gatcctggaa tgaatgaatc aaccatgaga 2100 
gagtgaatga acactggaat caatagagta gcagagtaat ggattgtgga gcaggaaaga 2160 
gagctgctgg gtgggaattc aattccaggc ttatatgagc cctgctgtgc agtcggcctg 2220 
gagacagccc agctcaggcc ctgcctagac ccctgtcaag gaggccctgt caagaggaga 2280 
ggaggggcag cacgggggca aggcaagctt gtgagcggga aaggcatgtc cactttagcg 2340 
actggtatgt ggaagatgag ttagaggaga cagatggaga gaagtcatag gaaataaatt 2400 
ctgagcattt taggagggcc cagacacctg gtgtccagtg gagtgaagga aacagtcgcc 2460 
tcccaaaatt cagtgtctga ggtcaaagga ttgaagttct gtgatgacca aggagaagcc 2520 
agctctgtgg tagggggcac aggagctccc caaggcccca gggctgtcca gctggctgtc 2580 
ccctgccagc acccatgtcc tgtgacccca ccccaccaag atcccatggt ttccgggaag 2640 
ggcctactaa actagcttga gtgatgaggc tagaaagggg ctgggaccaa ggtttaaaaa 2700 
gcaaaacaaa ctaacaaaaa ccacactgca gcccccccaa ctaaaacatt tttataaact 2760 
tttttttttt ttttgagatg gagtctcgct ctgtcaccca ggctagagtg caatggcaca 2820 
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atcttggctc actgtaacct ccacctcctg 
cacgtagctg ggactacagg cacacgacac 
agacagggtt tcactatgtt ggccaggctg 
ccacctcagc cttccaaagt gctgggatta 
ttgtaaactt ttacaatgaa gtaatttggt 
tttatgtata gttttaattt atcccactag 
gattattggg tatatgaaaa aaatattttc 
attttaaccc tgaaacattt gaataaggca 
tctgcatttt gcccaaatcc atccttgaat 
gaaggcaatg aggctcaagc cagggagggg 
gggaaaactg agggagatgg gggcagggct 
accctcctgg agccctgcac acaacttaag 
accacagctc tttcttcagg gacagacatg 
cttctagtgt gggtggctgt agtaggggag 
gagcttctca atgtctgcat gaacgccaag 
aagttgcatg agcaggtggg ccagggggtg 
gaggaaacga ggacatggaa atgccaaacc 
gccctteagt ttgcattaat atgggtgact 
acaatgccaa cagttcacct tcttggttgt 
aggcccaaag gagagcctgg gaaatgaagt 
tagggattta gactgggaat gactcctcca 
catagtggcc tcttttctgc cagccctaaa 
tgaggctgtg tgcaaagcat tctttttttt 
ccaggctggt ctcaaattcc tggactcaag 
tgggattaca gaaatgagcc gtacgccctc 
aactttgggc tgtgtctctc gaccacattg 
gctaccaccc ctttaatatc ctgaacatga 
gtaatttgta ggccaggtgt tacggctcac 
gatgggcaga tcacttgagc tcaggagttc 
catctctact aaaaaataaa aaaaattagt 
gctactcagg aggctgaggt gggcaggtca 
gtgccactgc actccagcct gggcaacaga 
aaaaaaagaa agaaaggaag gaaggaaggg 
gaggaaaggg agggaggcaa gggagagaaa 
tgagatagag ttttgctctt gttgcccagg 
tgcaacctcc acctcccagg ttcaagtgat 
cgccaccaca cccagctaat tttttgtttg 
tagagatggg ggtttcacca tgttggccag 
gcccctcttg gcctcccaaa gtgctgagat 
agtgcacatt ttatttattt atttatttat 
gcccaggctg gagtgcagtg gcacaatctc 
aagcaattct tctgccttgg cctccagagt 
ctagctaatt tttgtatttt tagtagaaat 
ccattcttga ccttaagtga tctgtccacc 
actatgtgag ccactgtgcc ggcccacatt 
gagattcagg aagcttgagg ataggcacac 
tgcagccatc acaagttagt ttttaaggaa 
acttataata aaataacatc cacaattgat 
ttccacaaac agataatggg tgaggcagct 
gggattacag gtgtccgcca ccacacttgg 
ggagtcttgc tctgtcgccc aggctggagt 



gattcaagtg attctcctgc ctcagcctcc 2880 
cgcacccagc tcattttgta tttttagtag 2940 
gtctcaaact tctgacctca ggtgatccac 3000 
caggcatgag ccaccgcgcc cagcccattt 3060 
gtcaaaatct; gacctgaaaa ttaatgtgag 3120 
tgtaactgtt tcaccccaga atatacactt 3180 
tttgaatcac ctttgatgaa atcctaaaaa 3240 
ttgt.ggacct atggcaaact cctggctatt 3300 
tatatcacct gaacctcgtg accacctgga 3360 
tggtgtctaa tcctaccttt cattggatct 3420 
ctatctgccc caggcttccg tccaggcccc 3480 
gccccacctc cgcattcctt ggtgccactg 3540 
gctcagcgga tgacaacaca gctgctgctc 3600 
gctcagacaa ggattgcatg ggccaggact 3 660 
caccacaagg aaaagccagg ccccgaggac 3720 
atctggggtg gtgagggact ggctcaggaa 3780 
ccattggcac tggtgaactg aagtggagga, 3840 
tatttcagag acactgtgcc aaatgtcggt 3900 
tgagtttccg cattacagaa ataaggaagc 3960 
tggagtgacc catcctgggg ttgcttgatt 4020 
aagatctgag ggaagaaact gcacactgtg 4080 
cagctcaaga agggagagtc tctcacatta 4140 
ttttcctgag acaaagtctc catatgttgc 4200 
tgatcctccc acctcagccc tcccaaagtg 4260 
ctgaagcatc ttggttcatg catctcgcaa 4320 
gacctgaggt ctccctataa catttatttt 4380 
tgatataact aaagaaaaag cagaggaaaa 4440 
gcctgtaatc ccaacactgt gggatgtcga 4500 
gagaccagcc tgggcaagat ggcaaaaccc 4 560 
caggtgtggt ggcacatgcc tgcagtccca 4 620 
gttgagccca ggaggcagag attgtagatc 4 680 
gtgagacctt gtcaaaagaa agaaagaacg 4740 
gaggaaggaa agggagggag gaaagggagg 4800 
cttgtaatac gcatttcttt ttttttttct 4860 
gtggatggca gtggcacaat ctcagctcac 4 920 
tctcctgcct cagcctcctg agtaggcaca 4980 
tttgtttgtt ttgtttgttg gtatttttag 5040 
gctggtctcg aactcctcac ctcataatcc 5100 
tacaggtgtg agccactgcg cccggcctta 5160 
ttatttattg agatggagtc ttgctctgtt 5220 
agctcactgc aacctccacc tcccaggttc 5280 
agctgggact ataggcacct gccaccatgc 5340 
ggggttttgc catgttggcc aggctggtct 5400 
tccacctccc aaagtgctgg gattacaggc 5460 
ttaatattta gcttgtcagc cttaagtaat 5520 
aggagcatag tttcaagttg tcctgaattt 5580 
aaagattagt tcctaagttg tttctcaata 5640 
tggctataca ttgttttttt gtatcacaaa 5700 
agtcagggac aaaacacttc ccaagtagct 5760 
ctagtttttt gtttgtttat tttttgagat 5820 
gcagtggcat gatctcggct cactgcaagc 5880 



7 



WO 00/71754 



PCT/US00/14354 



atcttggctc actgtaacct ccacctcctg 
cacgtagctg ggactacagg cacacgacac 
agacagggtt tcactatgtt ggccaggctg 
ccacctcagc cttccaaagt gctgggatta 
ttgtaaactt ttacaatgaa gtaatttggt 
tttatgtata gttttaattt atcccactag 
gattattggg tatatgaaaa aaatattttc 
attttaaccc tgaaacattt gaataaggca 
tctgcatttt gcccaaatcc atccttgaat 
gaaggcaatg aggctcaagc cagggagggg 
gggaaaactg agggagatgg gggcagggct 
accctcctgg agccctgcac acaacttaag 
accacagctc tttcttcagg gacagacatg 
cttctagtgt gggtggctgt agtaggggag 
gagcttctca atgtctgcat gaacgccaag 
aagttgcatg agcaggtggg ccagggggtg 
gaggaaacga ggacatggaa atgccaaacc 
gcccttcagt ttgcattaat atgggtgact 
acaatgccaa cagttcacct tcttggttgt 
aggcccaaag gagagcctgg gaaatgaagt 
tagggattta gactgggaat gactcctcca 
catagtggcc tcttttctgc cagccctaaa 
tgaggctgtg tgcaaagcat tctttttttt 
ccaggctggt ctcaaattcc tggactcaag 
tgggattaca gaaatgagcc gtacgccctc 
aactttgggc tgtgtctctc gaccacattg 
gctaccaccc ctttaatatc ctgaacatga 
gtaatttgta ggccaggtgt tacggctcac 
gatgggcaga tcacttgagc tcaggagttc 
catctctact aaaaaataaa aaaaattagt 
gctactcagg aggctgaggt gggcaggtca 
gtgccactgc actccagcct gggcaacaga 
aaaaaaagaa agaaaggaag gaaggaaggg 
gaggaaaggg agggaggcaa gggagagaaa 
tgagatagag ttttgctctt gttgcccagg 
tgcaacctcc acctcccagg ttcaagtgat 
cgccaccaca cccagctaat tttttgtttg 
tagagatggg ggtttcacca tgttggccag 
gcccctcttg gcctcccaaa gtgctgagat 
agtgcacatt ttatttattt atttatttat 
gcccaggctg gagtgcagtg gcacaatctc 
aagcaattct tctgccttgg cctccagagt 
ctagctaatt tttgtatttt tagtagaaat 
ccattcttga ccttaagtga tctgtccacc 
actatgtgag ccactgtgcc ggcccacatt 
gagattcagg aagcttgagg ataggcacac 
tgcagccatc acaagttagt ttttaaggaa 
acttataata aaataacatc cacaattgat 
ttccacaaac agataatggg tgaggcagct 
gggattacag gtgtccgcca ccacacttgg 
ggagtcttgc tctgtcgccc aggctggagt 



gattcaagtg attctcctgc ctcagcctcc 2880 
cgcacccagc tcattttgta tttttagtag 2940 
gtctcaaact tctgacctca ggtgatccac 3000 
caggcatgag ccaccgcgcc cagcccattt 3060 
gtcaaaatct gacctgaaaa ttaatgtgag 3120 
tgtaactgtt tcaccccaga atatacactt 3180 
tttgaatcac ctttgatgaa atcctaaaaa 3240 
ttgtggacct atggcaaact cctggctatt 3300 
tatatcacct gaacctcgtg accacctgga- 3360 
tggtgtctaa tcctaccttt cattggatct 3420 
ctatctgccc caggcttccg tccaggcccc 3480 
gccccacctc cgcattcctt ggtgccactg f 3540 
gctcagcgga tgacaacaca gctgctgctc 3600 
gctcagacaa ggattgcatg ggccaggact 3660 
caccacaagg aaaagccagg ccccgaggac 3720 
atctggggtg gtgagggact ggctcaggaa 3780 
ccattggcac tggtgaactg aagtggagga 3840 
tatttcagag acactgtgcc aaatgtcggt 3900 
tgagtttccg cattacagaa ataaggaagc 3960 
tggagtgacc catcctgggg ttgcttgatt 4020 
aagatctgag ggaagaaact gcacactgtg 4080 
cagctcaaga agggagagtc tctcacatta 4140 
ttttcctgag acaaagtctc catatgttgc 4200 
tgatcctccc acctcagccc tcccaaagtg 4260 
ctgaagcatc ttggttcatg catctcgcaa 4320 
gacctgaggt ctccctataa catttatttt 4380 
tgatataact aaagaaaaag cagaggaaaa 4440 
gcctgtaatc ccaacactgt gggatgtcga 4500 
gagaccagcc tgggcaagat ggcaaaaccc 4 560 
caggtgtggt ggcacatgcc tgcagtccca 4 620 
gttgagccca ggaggcagag attgtagatc 4680 
gtgagacctt gtcaaaagaa agaaagaacg 4740 
gaggaaggaa agggagggag gaaagggagg 4800 
cttgtaatac gcatttcttt ttttttttct 4860 
gtggatggca gtggcacaat ctcagctcac 4920 
tctcctgcct cagcctcctg agtaggcaca 4980 
tttgtttgtt ttgtttgttg gtatttttag 5040 
gctggtctcg aactcctcac ctcataatcc 5100 
tacaggtgtg agccactgcg cccggcctta 5160 
ttatttattg agatggagtc ttgctctgtt 5220 
agctcactgc aacctccacc tcccaggttc 5280 
agctgggact ataggcacct gccaccatgc 5340 
ggggttttgc catgttggcc aggctggtct 5400 
tccacctccc aaagtgctgg gattacaggc 5460 
ttaatattta gcttgtcagc cttaagtaat 5520 
aggagcatag tttcaagttg tcctgaattt 5580 
aaagattagt tcctaagttg tttctcaata 5640 
tggctataca ttgttttttt gtatcacaaa 5700 
agtcagggac aaaacacttc ccaagtagct 5760 
ctagtttttt gtttgtttat tttttgagat 5820 
gcagtggcat gatctcggct cactgcaagc 5880 
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tccacctgcc gggttcacac cattctcctg 
gtgccagcca ccacgcccgg ctaatttttt 
gttggccagg atggtcttga tctcttagcc 
tgctgggatt acaggcgtga gccaccgcac 
cggggtttca ccatgttggc caggctggtc 
ccttggcctc ccaaagtgct gggattacac 
caatttaaat tgctttttta ccttcaaatc 
gcactattac actacaagtc cccatccgtc 
aggtgatgtg ttcattgttg taaccccagt 
aggatctcaa taaaggttga attaacaaat 
cccttttggg ctcccactgt ctctgacggc 
tattttcctg ggtgacttcc agtgggctgg 
atgtggaata ttctggctgt gctggctgtg 
ccatccagtg tcgaccctgg aggaagaatg 
cccataagga tgtttcctac ctatatagat 
ctgcctgcaa acggcatttc atccaggaca 
ggccctggat ccagcaggta tgcatggctt 
gagctttcca ggcatctctg caggctgcaa 
tgctgggatt cttgaacctg agcccttctt 
ctggcgcaaa gagcgggtac tgaacgtgcc 
ggaagattgt cgcacctcct acacctgcaa 
ttcaggtgag ggctggggtg ggcaggaatg 
tggaacaggt atgtgacaat ttggagttgt 
cccagtggct aaaggtcttc cctcctctct 
gcctgccaac ctttccattt ctacttcccc 
actcactcct acaaggtcag caactacagc 
ttcgacccag cccagggcaa ccccaatgag 
agtggggctg ggccctgggc agcctggcct 
tggctgctca gctgacctcc ttttaccttc 
cccacagctc ccaactattt ggttcctgct 
ataaaccaga caccgcacat gtgtcttgag 



cctcagcctc ccaagtagct gggactacag 594G 
gtatttttag tagagacggg gtttcaccat 6000 
tcgtgatcca cccgcctcgg cctcccaaaa 6060 
ccggcctaat ttttatattt ttagtagaga 6120 
tcaaactctt gatctcaggt gatccacctg 6180 
aagtaagcca ctgcacccag cctggggtta 6240 
tttgacacct cagtgaggct taatctgacc 6300 
tctgcttaat ttttgtccaa agcaaaaatc 6360 
ttctacaaaa gtacctgggt gagagtaagt 6420 
tttgtaatga ctgcaactcc agcaggagct 6480, 
cctctcccct aaagaggtcc caatagcaag .654 0 
ggaatcaagg actaagaggg gagacactgc 6600 
ctggctgtgg actgagtcct ctgtcttccc 6660 
cctgctgttc taccaacacc agccaggaag 6720 
tcaactggaa ccactgtgga gagatggcac 6780 
cctgcctcta cgagtgctcc cccaacttgg 6840 
cctgcaggta caagacctag cggagcagct 6900 
ccccagctcc agttctattc ggggctgagt 6960 
ttgtatcaaa atcacccagg tggatcagag 7020 
cctgtgcaaa gaggactgtg agcaatggtg 7080 
gagcaactgg cacaagggct ggaactggac 7140. 
gagggatttg gaagtggagg tgtgtgggtg 7200 
agggctggca gacctcaaga tagttccggg 72 60 
acagggttta acaagtgcgc agtgggagct 7320 
acacccactg ttctgtgcaa tgaaatctgg 7380 
cgagggagtg gccgctgcat ccagatgtgg 7440 
gaggtggcga ggttctatgc tgcagccatg 7500 
ttcctgctta gcctggccct aatgctgctg 7560 
tgatacctgg aaatccctgc cctgttcagc 7620 
ccatggtcgg gcctctgaca gccactttga 7 680 
aattatttgg 7720 



<210> 6 
<211> 255 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Val Trp Lys Trp Met Pro Leu 
1 5 

Thr Met Cys Ser Ala Gin Asp Arg 
20 

Asp Ala Lys His His Lys Thr Lys 
35 40 

Asp Gin Cys Ser Pro Trp Lys Lys 
50 55 

Ser Gin Glu Leu His Lys Asp Thr 



Leu Leu Leu Leu Val Cys Val Ala 
10 15 

Thr Asp Leu Leu Asn Val Cys Met 
25 30 

Pro Gly Pro Glu Asp Lys Leu His 
45 

Asn Ala Cys Cys Thr Ala Ser Thr 
60 

Ser Arg Leu Tyr Asn Phe Asn Trp 
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tccacctgcc gggttcacac cattctcctg 
gtgccagcca ccacgcccgg ctaatttttt 
gttggccagg atggtcttga tctcttagcc 
tgctgggatt acaggcgtga gccaccgcac 
cggggtttca ccatgttggc caggctggtc 
ccttggcctc ccaaagtgct gggattacac 
caatttaaat tgctttttta ccttcaaatc 
gcactattac actacaagtc cccatccgtc 
aggtgatgtg ttcattgttg taaccccagt 
aggatctcaa taaaggttga attaacaaat 
cccttttggg ctcccactgt ctctgacggc 
tattttcctg ggtgacttcc agtgggctgg 
atgtggaata ttctggctgt gctggctgtg 
ccatccagtg tcgaccctgg aggaagaatg 
cccataagga tgtttcctac ctatatagat 
ctgcctgcaa acggcatttc atccaggaca 
ggccctggat ccagcaggta tgcatggctt 
gagctttcca ggcatctctg caggctgcaa 
tgctgggatt cttgaacctg agcccttctt 
ctggcgcaaa gagcgggtac tgaacgtgcc 
ggaagattgt cgcacctcct acacctgcaa 
ttcaggtgag ggctggggtg ggcaggaatg 
tggaacaggt atgtgacaat ttggagttgt 
cccagtggct aaaggtcttc cctcctctct 
gcctgccaac ctttccattt ctacttcccc 
actcactcct acaaggtcag caactacagc 
ttcgacccag cccagggcaa ccccaatgag 
agtggggctg ggccctgggc agcctggcct 
tggctgctca gctgacctcc ttttaccttc 
cccacagctc ccaactattt ggttcctgct 
ataaaccaga caccgcacat gtgtcttgag 



cctcagcctc ccaagtagct gggactacag 5940 
gtatttttag tagagacggg gtttcaccat 6000 
tcgtgatcca cccgcctcgg cctcccaaaa 6060 
ccggcctaat ttttatattt ttagtagaga 6120 
tcaaactctt gatctcaggt gatccacctg 6180 
aagtaagcca ctgcacccag cctggggtta 6240 
tttgacacct cagtgaggct taatctgacc 6300 
tctgcttaat ttttgtccaa agcaaaaatc 6360 
ttctacaaaa gtacctgggt gagagtaagt 6420 
tttgtaatga ctgcaactcc agcaggagct 6480 
cctctcccct aaagaggtcc caatagcaag 6540 
ggaatcaagg actaagaggg gagacactgc 6600 
ctggctgtgg actgagtcct ctgtcttccc 6660 
cctgctgttc taccaacacc agccaggaag 6720 
tcaactggaa ccactgtgga gagatggcac 6780 
cctgcctcta cgagtgctcc cccaacttgg 6840 
cctgcaggta caagacctag cggagcagct 6900 
ccccagctcc agttctattc ggggctgagt 6960 
ttgtatcaaa atcacccagg tggatcagag 7020 
cctgtgcaaa gaggactgtg agcaatggtg 7080 
gagcaactgg cacaagggct ggaactggac 7140 
gagggatttg gaagtggagg tgtgtgggtg 7200 
agggctggca gacctcaaga tagttccggg 72 60 
acagggttta acaagtgcgc agtgggagct 7320 
acacccactg ttctgtgcaa tgaaatctgg 73 80 
cgagggagtg gccgctgcat ccagatgtgg 7440 
gaggtggcga ggttctatgc tgcagccatg 7500 
ttcctgctta gcctggccct aatgctgctg 7560 
tgatacctgg aaatccctgc cctgttcagc 7620 
ccatggtcgg gcctctgaca gccactttga 7680 
aattatttgg * 7720 



<210> 6 

<211> 255 

<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Val Trp Lys Trp Met Pro Leu Leu Leu Leu Leu Val Cys Val Ala 
1 5 10 15 

Thr Met Cys Ser Ala Gin Asp Arg Thr Asp Leu Leu Asn Val Cys Met 
20 25 30 



Asp Ala Lys His His Lys Thr Lys Pro Gly Pro Glu Asp Lys Leu His 
35 40 45 



Asp Gin Cys Ser Pro Trp Lys Lys Asn Ala Cys Cys Thr Ala Ser Thr 
50 55 60 



Ser Gin Glu Leu His Lys Asp Thr Ser Arg Leu Tyr Asn Phe Asn Trp 
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65 70 75 80 

Asp His Cys Gly Lys Met Glu Pro Ala Cys Lys Arg His Phe lie Gin 
85 90 95 

Asp Thr Cys Leu Tyr Glu Cys Ser Pro Asn Leu Gly Pro Trp He Gin 
100 105 110 

Gin Val Asn Gin Thr Trp Arg Lys Glu Arg Phe Leu Asp Val Pro Leu 
115 120 * 125 

Cys Lys Glu Asp Cys Gin Arg Trp Trp Glu Asp Cys His Thr Ser His 
130 135 140 

Thr Cys Lys Ser Asn Trp His Arg Gly Trp Asp Trp Thr Ser Gly Val 
145 150 155 160 

Asn Lys Cys Pro Ala Gly Ala Leu Cys Arg Thr Phe Glu Ser Tyr Phe 
165 170 175 

Pro Thr Pro Ala Ala Leu Cys Glu Gly Leu Trp Ser His Ser Tyr Lys 
180 185 190 

Val Ser Asn Tyr Ser Arg Gly Ser Gly Arg Cys He Gin Met Trp Phe 
195 200 205 

Asp Ser Ala Gin Gly Asn Pro Asn Glu Glu Val Ala Arg Phe Tyr Ala 
210 215 220 

Ala Ala Met His Val Asn Ala Gly Glu Met Leu His Gly Thr Gly Gly 
225 230 235 240 

Leu Leu Leu Ser Leu Ala Leu Met Leu Gin Leu Trp Leu Leu Gly 
245 250 255 



<210> 7 

<211> 817 

<212> DNA 

<213> Homo sapiens 

<400> 7 

cgcaggaata gatggacatg gcctggcaga 
ctgctgcggg gagtgcccag cccaggagtg 
gcatgaacgc caagcaccac aagacacagc 
gcagtccctg gaagaagaat gcctgctgca 
acacctcccg cctgtacaac tttaactggg 
agcgccactt tatccaggac agctgtctct 
cggcaggtca accagagctg gcgcaaagag 
gactgtgagc gctggtggga ggactgtcgc 
aaaggctgga attggacctc agggattaat 



tgatgcagct gctgcttctg gctttggtga 60 
cgcgggccag gacggacctg ctcaatgtct 120 
ccagccccga- ggacgagctg tatggccagt 180 
cggccagcac cagccaggag ctgcacaagg 240 
atcactgtgg taagatggaa cccacctgca 300 
gagtgctcac ccaacctggg gccctggatc 360 
cgcattctga acgtgcccct gtgcaaagag 420 
acctcctaca cctgcaaaag caactggcac 480 
gagtgtccgg ccggggccct ctgcagcacc 540 
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65 



70 



75 



80 



Asp His Cys Gly Lys Met Glu Pro Ala Cys Lys Arg His Phe lie Gin 
85 90 95 

Asp Thr Cys Leu Tyr Glu Cys Ser Pro Asn Leu Gly Pro Trp He Gin 
100 105 110 

Gin Val Asn Gin Thr Trp Arg Lys Glu Arg Phe Leu Asp Val Pro Leu 
115 120 125 

Cys Lys Glu Asp Cys Gin Arg Trp Trp Glu Asp Cys His Thr Ser His 
130 135 140 

Thr Cys Lys Ser Asn Trp His Arg Gly Trp Asp Trp Thr Ser Gly Val 
145 150 155 160 

Asn Lys Cys Pro Ala Gly Ala Leu Cys Arg Thr Phe Glu Ser Tyr Phe 
165 170 175 

Pro Thr Pro Ala Ala Leu Cys Glu Gly Leu Trp Ser His Ser Tyr Lys 
180 185 190 

Val Ser Asn Tyr Ser Arg Gly Ser Gly Arg Cys He Gin Met Trp Phe 
195 200 205 

Asp Ser Ala Gin Gly Asn Pro Asn Glu Glu Val Ala Arg Phe Tyr Ala 
210 215 220 

Ala Ala Met His Val Asn Ala Gly Glu Met Leu His Gly Thr Gly Gly* 
225 230 235 240 



Leu Leu Leu Ser Leu Ala Leu Met Leu Gin Leu Trp Leu Leu Gly 
245 250 255 



<210> 7 

<211> 817 

<212> DNA 

<213> Homo sapiens 

<400> 7 

cgcaggaata gatggacatg gcctggcaga 
ctgctgcggg gagtgcccag cccaggagtg 
gcatgaacgc caagcaccac aagacacagc 
gcagtccctg gaagaagaat gcctgctgca 
acacctcccg cctgtacaac tttaactggg 
agcgccactt tatccaggac agctgtctct 
cggcaggtca accagagctg gcgcaaagag 
gactgtgagc gctggtggga ggactgtcgc 
aaaggctgga attggacctc agggattaat 



tgatgcagct gctgcttctg gctttggtga 60 
cgcgggccag gacggacctg ctcaatgtct 120 
ccagccccga ggacgagctg tatggccagt 180 
cggccagcac cagccaggag ctgcacaagg 240 
atcactgtgg taagatggaa cccacctgca 300 
gagtgctcac ccaacctggg gccctggatc 360 
cgcattctga acgtgcccct gtgcaaagag 420 
acctcctaca cctgcaaaag caactggcac 480 
gagtgtccgg ccggggccct ctgcagcacc 540 
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aag SS Sa^g ^ tCt ^ ccactccttc 600 

cagggcaacc ccaatgagga SSJSSS ttcS V agat ^« tgactcagcc 660 
ccgtctcgtg ggattatga ?tSgatcc aSaaaaS T?**" ^^ggggcc 720 
cctattctaa tagacaaatc cacatgaaaa tSUS^ ctctggggtt cttccaacaa 780 



<210> 8 

<211> 1669 

<212> DNA 

<213> Homo sapiens 

<400> 8 



St^^^SS 2S2s SEES r gg9CCCa c ™ 60 

JSSS ~a cgg-gaS SgSggl SKfe Sj 
tactctgagg ggLcccggg £ I?™ 9 9CtCttgctt aaataacaaa 240 
gagaccctct gtcagaagc? agcc^gcaa a.tllr"^ Ctgagtttat tgatgaactg 300 
gtcaacgtcc agcccta^t? aggctcS? S ? tggacccaca gtgctggggg 360 
gaaccccatg ggcgcatcat gggcctggac cttcc^J^ Ctgtgtacac . ^ccctggtg 420 
tt G a t gacag acaagaagaa Ectctgcc acgtccatc? 9aCCCatggg 48 ° 

aaggtgaacc cagatactgg ctacatcaac ta^*- * Cttt 9 aatc . tatgccctac. 540 
ttccacccga agctgatca? egSgaacc tggaggagaa c ^gcctc 600 

cggctacgga agattgcaga tgagaSggg SgS'ctca laT^ 99aatatgcc W 
agcgggctgg tggcggctgg cgtggtgcS ?cS™S? tSScggacat ggctcacatc 720 
accaccactc acaagacc?? g?gagg?tgc aaCact9CCa tgtggtgacc 780 

gtgaaaagtg tggatcccaa gactggcala SS? tgatcttcta caggaaagga 840 

aattctgctg tgttccctgg LtgSa ^™? aCaacctg 9 a Btctcttatc 900 
gctgtggcac tgaagcaag? tatgaSSa ac "cgccat tgctggggtt 960 

gccaactgca ggg C ?ctg?c tgaggccctg acgSaoctS -ag gt g gtg 1020 

ggttctgaca accatttgat c«2t£a? J2£?tS 1?*™*^ a9tcaca ^ 1080 
gctgagaagg tgctagaagc ctgttctatt J ■ ! aaggcacaga tggtggaagg 1140 
agaagcgctc tgcggcccag tggactgcgg ctaaa^f agaacac:ct 9 tpcaggtgac 1200 
cttttggaaa aagacttcca aaaag^agJc cagcactgac gtcccgtgga 1260 

ctgcagatcc agagcgacac tggtgtcaga oS^ f aca ^gggat agagctgacc 1320 
gcagggg ata agtaccaggc ggStgcaa a ' 9 aa 9agttcaa ggagagactg 1380 
tctctcttcc ctctgcc?gg StgSS? ttctaa^ a " aggttga ^gcttcgcc 1440 
cctggcgcca cagaggaagc tgcctgcLcr J;"^^ 9 f S99CCCac tctggaccca 1500 
tccaaaggga .actgttgaca cLgggS? J££S£? = tga9agatg ^tgagctgc 1560 
gttttcttca caaa t caaaa ea StS £S££ agSSf """^ J™ 



<210> 9 
<211> 3112 
<212> DNA 
<213> Homo sapiens 

<400> 9 



SSSS SSS2 ££SS ~S S aa taaa9 g ~ « 

aagtcactca gttgaaggag caagtaccta ctJt^f aagggc 9 a 9 a ctgaaaaatc 120 
ttggcaacag agatgalLc aatctttata Sata^ aC f Ctggca "attacagg 180 

ccctcata taaatgtgaa gctgaaggct gctgaagaga 240 
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« t "StS SSS r fc 9t9aag 3CCtctggag ^ec tt c 6oo 

cagggcaacc ccaatjagga g^ggccaaa tSl^T tgactcagcc 660 

ccgtctcgtg ggattat^ga t?cc?gatcc llaltJ, ? CggCCat ^ aa tgctggggcc 720 
cctattctaa tagacaaa?c caSSa^ CtCtggggtt cttccaacaa 780 

on 



<210> 8 
<211> 1669 
<212> DNA 

<213> Homo sapiens 
<400> 8 



2£g£ SEES ?f 953cc ™ SMS3 "- t 60 

gtttacaaca tcattaagaa ggaaaataa^ tZT tcaaagacag tgatgttgag 120 

tcggagaatt tcgccagLg agcag^ttp ofaac^?" " ggattgga ^tgactgcc 180 
tactctgagg ggtacc?ggg cSagaS tiling * g « Cttgctt aaataacaaa 240 
gagaccctct gtcagaagS aacS^o tat W c 39S* .ctgagtttat tgatgaactg 300 
gteMcgtce agccJta^tc aggctcccc? gcaaStta gtgCtgg9 ^ 360 

gaaccccatg ggcgcatcat gggcctggac ctSSS* Ctgtgtacac cgccctggtg 420 
ttcatgacag acaagaagaa aatctclg" acgtcStc? SHf"^ g3CCCat ^- 480 . 
aaggtgaacc cagatactgg ctacatcaac IttntZ ^ctttgaatc tatgccctac 540 
ttccacccga agctgatcat cJcaSacc aa^T^ tggaggagaa cgcacgcctc 600 
cggctacgga agat?gcaga tgagaacggg SStctL ' CCgaaaCCt ^aatatgcc. 660 
agcgggctgg tggcggctgg. cgtggjgccc ^ccccattta z^T^ 9getCaCatc ™° 
accaccactc acaagaccct gcgagg^gc coa«?2£f aaCaCtgcca ^gtggtgacc 780 
gtgaaaagtg tggatcccaa gactgcaaa gaaatS^ tgatCttcta "ggaaagga 840 
aattctgctg tgttccctgg cctccaa 333 9 acaacctgga gtctcttatc 900 

Sctgtggcac tgaagcaage tatgact^a alltt^ aCCac9 " at tgctggggtt 960 
gccaactgca gggctctgtc tgaggccctg ^aaa"^ 39 tttatCaaca ccaggtggtg 1020 
ggttctgaca accatctgat ccttgtgga? otllalltll W agtCacag9t *>*° 

gctgagaagg tgctagaagc ctgttctatt iLf aa ggcacaga tggtggaagg U 40 
agaagcgctc tgeggcccag tggactgcgg SSa^ tcca ^gac. 1200 

cttttggaaa aagacttcca aafagtagcc 25?£S 2*°*^ 3 tc ^gt g ga 1260 
ctgcagatcc agagcgacac tggtatcaaa + aca g*gggat agagctgacc 1320 

gcaggggata agtaccaggc £££££ actctcf " aagagttCaa Srgagagactg 1380 
tctctcttcc ctctgcctgg cctgJSaac ?£? " 9 aggaggCt 9 a gagcttcgcc 1440 
cctggcgcca cagaggaagc tgcctgSa 2 **** gcg ^ cccac "tggaccca 1500 
tccaaaggga actgSflaL c?c2lc2? ^ Ctgaga ^tg gatgagctgc 1560 

g ttt tc tt ca caaatcLaa ^™ SSSSf ^ 



<210> 9 
<211> 3112 
<212> DNA 

<213> Homo sapiens 
<400> 9 



££££ SSEJS ££££ c^ 9 '" 

aagtcactca gttgaagSg caagtSS atltr^ aagggcga9a «»— *«tc 120 
ttggcaacag aga?ga?? c ? aatctStl SIS??* aCgCCtggca «*ttacaffg 180 

aacccttata taaatgtgaa gctgaaggct gctgaagaga 240 
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ttgggatcaa agccactcac attaagttac 
agtacattac atctttgaat gaagactcta 
tagattcaga gaattccatt aacactgaag 
atgtggatgg attgactagc atcaatgctg 
gtttcattcc ttgtacgcct aagggatgct 
ttgccggaag gcatgctgtg gtggttgggc 
acttgcttct gtggaacaat gccacagtga 
atgaggaggt aaataaaggt gacatcctgg 
aaggggagtg gatcaaacct ggggcaatag 
atgataaaaa accaaatggg agaaaagttg 
agagggcgag cttcatcact cctgttcctg 
tcatgcagag cacagtagag agtgccaagc 
ggatgattca gtataacaac cttaacctca 
cacgatcttg taaaccgaag cccattggta 
aagaggtaga attatatggt gaaacaaagg 
tgaagcaccg gcctgatggg aaatacgtgg 
gagaagggaa aagcacaact acaatcgggc 
agaatgtctt tgcgtgtgtg cgacagcctt 
gcgctgcagg aggcggctac tcccaggtca 
caggtgacat ccatgccatc actgcagcta 
ggatatttca tgaactgacc cagacagaca 
taaatggagt gagaaggttc tctgacatcc 
aaaagactga ccctaccaca ctgacagatg 
ttgatccaga aaccataact tggcaaagag 
^gatcacgat tggacaggct ccaacggaga 
tctctgtggc cagtgaaatt atggctgtcc 
gagagagact gggcaaaatg gtggtggcat 
aagatctggg ggtgagtggt gcactgacag 
tcatgcagac actggagggc actzccagtgt 
cacatggcaa ttcctccatc attgcagacc 
ggtttgtagt gacggaagca ggatttggag 
tcaaatgccg gtattccggc ctctgccccc 
ctctcaagat gcacgggggc ggccccacgg 
acatacagga gaacctggag ctggttgaaa 
aaaatgccag aatgtttgga attccagtag 
cagagtctga gctggacctc atcagccgcc 
tgaagtgcac tcactgggca gaagggggca 
agagagcagc acaagcaccc agcagcttcc 
aggataaaat caggatcatt gcacagaaga 
* ccgaagctca acacaaagct gaagtctaca 
gcatggctaa aacacacttg tctttgtctc 
gcttcattct gcccattcgc gacatccgcg 
tagtaggaac gatgagcaca atgcctggac 
atttggaccc tgaaacagaa caggtgaatg 
caagaagcta ctttgaaagt ctggccagtg 
ataagtaagc caagagaagt cagcccctgc 
tccccagaag tcattttcag ccttaattct 
tgtctgttta ctttagtgac gttccacaga 

<210> 10 
<211> 1792 



caagaacaac cacagaatct gaggtgatga 300 
ctgtacatgg gttcttagtg cagctacctt 360 
aagtgatcaa tgctattgca cccgagaagg 420 
ggagacttgc tagaggtgac ctcaatgact 480 
tggaactcat caaagagaca ggggtgccga 540 
gcagtaaaat agttggggcc ccgatgcatg 600 
ccacctgcca ctccaagact gcccatctgg 660 
tggttgcaac tggtcagcct gaaatggtta 720 
tcatcgactg tggaatcaat tatgtcccag 780 
tgggtgatgt ggcatacgac gaggccaaag 840 
gcggcgtagg gcccatgaca gttgcaatgc 900 
gtttcctgga gaaatttaag ccaggaaagt 960 
agacacctgt tccaagtgac attgatatat 1020 
agctggctcg agaaattggt ctgctgtctg 1080 
ccaaagttct gctgtcagca ctagaacgcc 1140 
tggtgactgg aataactcca acacccctgg 1200 
tagtgcaagc ccttggtgcc catctctacc 1260 
ctcagggccc cacctttgga ■ ataaaaggtg 1320 
ttcctatgga agagtttaat ctccacctca 1380 
ataacctcgt tgctgcggcc attgatgctc 1440 
aggctctctt taatcgtttg gtgccatcag 1500 
aaatccgaag gttaaagaga ctaggcattg 1560 
aagagataaa cagatttgca agattggaca 1620 
tgttggatac caatgataga ttcctgagga 1680 
agggtcacac acggacggcc cagtttgata 1740 
tggctctcac cacttctcta gaagacatga 1800 
ccagtaagaa aggagagccc gtcagtgccg 1860 
tgcttatgaa ggacgcaatc aagcccaatc 1920 
ttgtccatgc tggcccgttt gccaacatcg 1980 
ggatdgcact caagcttgtt ggcccagaag 2040 
cagacattgg aatggaaaag ttttttaaca 2100 
acgtggtggt gcttgttgcc actgtcaggg 2160 
tcactgctgg actgcctctt cccaaggctt 2220 
aaggcttcag taacttgaag aaacaaattg 2280 
tagtggccgt gaatgcattc aagacggata 2340 
tttccagaga acatggggct tttgatgccg 2400 
agggtgcctt agccctggct caggccgtcc 24 60 
agctccttta tgacctcaag ctcccagttg 2520 
tctatggagc agatgacatt gaattacttc 2580 
cgaagcaggg ctttgggaat ctccccatct 2640 
acaacccaga gcaaaaaggt gtccctacag 2700 
ccagcgttgg ggctggtttt ctgtacccct 2760 
tccccacccg gccctgtttt tatgatattg 2820 
gattattcta aacagatcac catccatctt 2880 
tctattcagg cccactggga gttaggaagt 2940 
ccagaagatc tgaaactaat agtaggagtt 3000 
catcatgtat aaattaacat aaatcatgca 3060 
ataaaaggaa acaagtttgc ca 3112 
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ttgggatcaa agccactcac attaagttac caagaacaac cacagaatct gaggtgatga 300 
agtacattac atctttgaat gaagactcta ctgtacatgg gttcttagtg cagctacctt 360 
tagattcaga gaattccatt aacactgaag aagtgatcaa tgctattgca cccgagaagg 420 
atgtggatgg attgactagc atcaatgctg ggagacttgc tagaggtgac ctcaatgact 480 
gtttcattcc ttgtacgcct aagggatgct tggaactcat caaagagaca ggggtgccga 540 
ttgccggaag gcatgctgtg gtggttgggc gcagtaaaat agttggggcc ccgatgcatg 600 
acttgcttct gtggaacaat gccacagtga ccacctgcca ctccaagact gcccatctgg 660 
atgaggaggt aaataaaggt gacatcctgg tggttgcaac tggtcagcct gaaatggtta 720 
aaggggagtg gatcaaacct . ggggcaatag tcatcgactg tggaatcaat tatgtcccag 780* 
atgataaaaa accaaatggg agaaaagttg tgggtgatgt ggcatacgac gaggccaaag 840 
agagggcgag cttcatcact cctgttcctg gcggcgtagg gcccatgaca gttgcaatgc 900 
tcatgcagag cacagtagag agtgccaagc gtttcctgga gaaatttaag ccaggaaagt 960 
ggatgattca gtataacaac cttaacctca agacacctgt tccaagtgac attgatatat 1020 
cacgatcttg taaaccgaag cccattggta agctggctcg agaaattggt ctgctgtctg 1080 
aagaggtaga attatatggt gaaacaaagg ccaaagttct gctgtcagca ctagaacgcc 1140 
tgaagcaccg gcctgatggg aaatacgtgg tggtgactgg aataactcca acacccctgg 1200 
gagaagggaa aagcacaact acaatcgggc tagtgcaagc ccttggtgcc catctctacc 1260 
agaatgtctt tgcgtgtgtg cgacagcctt ctcagggccc cacctttgga ataaaaggtg 1320 
gcgctgcagg aggcggctac tcccaggtca ttcctatgga agagtttaat ctccacctca 1380 
caggtgacat ccatgccatc actgcagcta ataacctcgt tgctgcggcc attgatgctc 1440 
ggatatttca tgaactgacc cagacagaca aggctctctt taatcgtttg gtgccatcag 1500 
taaatggagt gagaaggttc tctgacatcc aaatccgaag gttaaagaga ctaggcattg 1560 
aaaagactga ccctaccaca ctgacagatg aagagataaa cagatttgca agattggaca 1620* 
ttgatccaga aaccataact tggcaaagag tgttggatac caatgataga ttcctgagga 1680 
agatcacgat tggacaggct ccaacggaga agggtcacac acggacggcc cagtttgata 1740 
tctctgtggc cagtgaaatt atggctgtcc tggctctcac cacttctcta gaagacatga 1800 
gagagagact gggcaaaatg gtggtggcat ccagtaagaa aggagagccc gtcagtgccg 1860 
aagatctggg ggtgagtggt gcactgacag tgcttatgaa ggacgcaatc aagcccaatc 1920 
tcatgcagac actggagggc actccagtgt ttgtccatgc tggcccgttt gccaacatcg 1980 
cacatggcaa ttcctccatc attgcagacc ggatcgcact caagcttgtt ggcccagaag 2040 
ggtttgtagt gacggaagca ggatttggag cagacattgg aatggaaaag ttttttaaca 2100 
tcaaatgccg gtattccggc ctctgccccc acgtggtggt gcttgttgcc actgtcaggg 2160 
ctctcaagat gcacgggggc ggccccacgg tcactgctgg actgcctctt cccaaggctt 2220 
'acatacagga gaacctggag ctggttgaaa aaggcttcag taacttgaag aaacaaattg 2280 
aaaatgccag aatgtttgga' attccagtag tagtggccgt gaatgcattc aagacggata 2340 
cagagtctga gctggacctc atcagccgcc tttccagaga acatggggct tttgatgccg 2400 
tgaagtgcac tcactgggca gaagggggca agggtgcctt agccctggct caggecgtcc 2460 
agagagcagc acaagcaccc agcagcttcc agctccttta tgacctcaag ctcccagttg 2520 
aggataaaat caggatcatt gcacagaaga tctatggagc agatgacatt gaattacttc 2580 
ccgaagctca acacaaagct gaagtctaca cgaagcaggg ctttgggaat ctccccatct 2640 
gcatggctaa aacacacttg tctttgtctc acaacccaga gcaaaaaggt gtccctacag 2700 
gcttcattct gcccattcgc gacatccgcg ccagcgttgg ggctggtttt ctgtacccct 2760 
tagtaggaac gatgagcaca atgcctggac tccccacccg gccctgtttt tatgatattg 2820 
atttggaccc tgaaacagaa caggtgaatg gattattcta aacagatcac catccatctt 2880 
caagaagcta ctttgaaagt ctggccagtg tctattcagg cccactggga gttaggaagt 2940 
ataagtaagc caagagaagt cagcccctgc ccagaagatc tgaaactaat agtaggagtt 3000 
tccccagaag tcattttcag ccttaattct catcatgtat aaattaacat aaatcatgca 3060 
tgtctgttta ctttagtgac gttccacaga ataaaaggaa acaagtttgc ca 3112 

<210> 10 
<211> 1792 
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<212> DNA 

<213> Homo sapiens 
<400> 10 

cgcagcccag actcagactg gggaagcaaa caggggctgg acaggccagg agagcctgtc 60 
ggacagtgat cctgagatgt gggagttgct gcagagggag aaggacaggc agtgtcgtgg 120 
cctggagctc attgcctcag agaacttctg cagccgagct gcgctggagg ccctggggtc 180 
ctgtctgaac aacaagtact cggagggtta tcctggcaag agatactatg ggggagcaga 240 
ggtggtggat gaaattgagc tgctgtgcca gcgccgggcc ttggaagcct ttgacctgga 300 
tcctgcacag tggggagtca atgtccagcc ctactccggg tccccagcca acctggccgt 360 
ctacacagcc cttctgcaac ctcacgaccg gatcatgggg ctggacctgc ccgatggggg 420 
ccatctcacc cacggctaca tgtctgacgt caagcggata tcagccacgt ccatcttctt 480 
cgagtctatg ccctataagc tcaaccccaa aactggcctc attgactaca accagctggc 540 
actgactgct cgacttttcc ggccacggct catcatagct ggcaccagcg cctatgctcg 600 
cctcattgac tacgcccgca tgagagaggt gtgtgatgaa gtcaaagcac acctgctggc 660 
agacatggcc cacatcagtg gcctggtggc tgccaaggtg attccctcgc ctttcaagca 720 
cgcggacatc gtcaccacca ctactcacaa gactcttcga ggggccaggt cagggctcat 780 
cttctaccgg aaaggggtga aggctgtgga ccccaagact ggccgggaga tcctttacac 840 
atttgaggac cgaatcaact ttgccgtgtt cccatccctt caggggggcc cccacaatca 900 
tgccattgct gcagtagctg tggccctaaa gcaggcctgc acccccatgt tccgggagta 960 
ctccctgcag gttctgaaga atgctcgggc catggcagat gccctgctag agcgaggcta 102 0 
ctcactggta tcaggtggta ctgacaacca cetggtgctg gtggacctgc ggcccaaggg. 1080 
cctggatgga gctcgggctg agcgggtgct agagcttgta tccatcactg ccaacaagaa 1140 
cacctgtcct ggagaccgaa gtgccatcac accgggcggc ctgcggcttg gggccccagc 1200 
cttaacttct cgacagttcc gtgaggatga cttccggaga gttgtggact ttatagatga 1260 
aggggtcaac attggcttag aggtgaagag caagactgcc aagctccagg atttcaaatc 1320 
cttcctgctt aaggactcag aaacaagtca gcgtctggcc aacctcaggc aacgggtgga 1380 
gcagtttgcc agggccttcc ccatgcctgg ttttgatgag cattgaaggc acctgggaaa 1440 
tgaggcccac agactcaaag ttactctcct tccccctacc tgggccagtg aaatagaaag 1500 
cctttctatt ttttggtgcg ggagggaaga cctctcactt agggcaagag ccaggtatag 1560 
tctcccttcc cagaatttgt aactgagaag atcttttctt tttccttttt ttggtaacaa 1620 
gacttagaag gagggcccag gcactttctg tttgaacccc tgtcatgatc acagtgtcag 1680 
agacgcgtcc tctttcttgg ggaagttgag gagtgccctt cagagccagt agcaggcagg 1740 
ggtgggtagg caccctcctt cctgttttta tctaataaaa tgctaacctg ca 1792 

<210> 11 
<211> 18596 
<212> DNA 

<213> Homo sapiens 
<400> 11 

cctgtagtcc cagctacgcg agaggctgag gcagcagaat tacttgaacc caggaggcgg 60 
aggttgcagt gagccgagat cgcgccactg cactccagcc tgggtgagag agcgagactc 120 
tgtctcaaaa aaaaaaaaaa aagaccgcca gggctcaaac aaaaaacctc ggaaaagccc 180 
tggcggtctt tttttttttt tttttttttt ttttttggga cagtcttgct ctgtcgccca 240 
ggctggagta caatggtcgg atcttggctc actgcaacct ctgcctccca ggttcaagca 300 
attcttctgc ctcagcctcc caagtagcca ccacgcccag ctaatttttg tacttttagt 360 
agagacgggg gtttcaccat gttgtccagg ctggtcttga actcctgacc tcaggtgatc 420 
cacccgcctc ggccccccaa agtactagga ttacaggcgt gagccaccgc gtccagcgcc 480 
ctggcggttt ttaatcaagt agaaaagctg cattatacca cttgcttcgg ttgcttcagt 540 
gagaacgaag aaatggaaat gcaaatccct tattagttgt aggaaacaga tctcaaacag 600 
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<212> DNA 

<213> Homo sapiens 

<400> 10. 



SSSJ SSSS ~ 9g acag * CC399 * 

cctggagctc attgcctcag IgaStctg SSSS? 3995309990 »W»tgg 120 
ctgtctgaac aacaagtac? cggagggtta SeSSS 9CgCtggagg ccct ^ggtc 180 
ggtggtggat gaaattgagc tgctgtgcca aclcaTaaJ ggggagcaga 240 

tcctgcacag tggggagtca aJgtLagcc ctactSaaa " 95aagcct "gacctgga. 300 
ctacacagcc cttctgcaac ctcaccacca 9 tccGca 9 cca acctggccgt 360 

ccatctcacc cacggctaca t g tctga"cg? £££££ ^tggacctgc ccgatggggg- 420 
cgagtctatg ccctataagc tcaaccccaa aic?SS tCagccac ? t ccatcttctt 480 
actgactgct cgacttttcc ggccacggct att9aCtaca ^agctggc, 540 

cctcattgac tacgcccgca ^agaglggt gSgSaa" Trt^* CCtat ^* ™0 
agacatggcc cacatcagtg gcccgotaoc llrntt t ^ tca ^ac acctgctggc 660 
cgcggacatc gtcaccacca ScScaa 9ttCCCtcgc "ttcaagca 720 

cttctaccgg aaaggggtga aggctgtgga Jcccaaaa? 3 9gggCCaggC "OTOctcat 780 
att t g a g gac cgaatcaact ttgccgtg?* cccatcSS 9gCC9ggaga tcctttacac 840 
tgccattgct gcagtagctg tgqccctaaa ° ca ggggggcc cccacaatca 900- 

ctccctgcag gttctgaaga atgctcggge SKoat tccgggagta 960 

ctcactggta tcaggtggta ctgacaacca "t£to2«- gCCCtgcta 9 agcgaggcta 1020 
cctggatgga gctcgggctg. ag^gggtgct aSSS? ^^^ ^caaggg 1080 
cacctgtcct ggagaccgaa .gtgccatcac tlcocllaln tccatcac =g ccaacaagaa 1140 
cttaacttct cgacagttcc gtgaggat^ cttSS! '^P" 9 1200 
aggggtcaac attggcttag. agjtgaagag caaqaSa? 3 ?" 9 ' ggact "atagatga 1260 
cttcctgctt aaggactcag aaacaagtca StcSr = 9 ^ C399 atttcaaa tc 1320 
gcagtttgcc agggccttcc ccatgcSgg ?t?£2™ aac «caggc Aacgggtgga. 1380 
tgaggcccac agactcaaag ttac?et£? EccSSS? Cattgaa ^c acctgggaaa 1440 
cctttctatt ttttggtgcg ggagggaaga cctctcactt ST 68 * 9 aaata ^ aaa g 1500 
tctcccttcc cagaatttgt aac^gagaag atcttttS JST^? 8 CCag * tata 9 "60 
gacttagaag gagggcccag gcactttctg tttoaa^ "* Ccttttt "ggtaacaa 1620 
agacgcgtcc tctttcttgg ggaagttgag aaa? 3 ™ tgtCatgatc acagtgtcag 1680 
ggtgggtagg caccctcct? 2^SS2 KaltaSa £££££ g-ggcagg 1740 



<210> 11 
<211> 18596 
<212> DNA 

-<213> Homo sapiens 
<400> 11 



SSSS ssss S32S ffiSS t t3Ct c t3aacc C399399cgg 60 

tgtctcaaaa aaaaaaaaaa aagaccgcca Sact^ tgggCgaga 9 agcgagactc 120 
tggcggtctt tttttttttt tt?tttt CCa g ??tt? a3aC 33aaaaCCtC Waaaagccc 180 
ggctggagta caatggtcgg atcttagctc actoS" 3 Cagtcttoct "gtcgccca 240 
attcttctgc ctca£ctS c^gSgcca ccacacc^ " gcctccca ^tcaagca 300 
agagacgggg gtttcaccat gttgtcSg ctaaS^ 39 Ctaattttt 9 tacttttagt 360 
cacccgcctc ggccccccaa Igtfctagga ^Scagacq 3 ! CtCCtgaCC «=C«Wtg«tc 420 
ctggcggttt ttaatcaagt agaaaagcK catta?^ 9agCCaccgc gtccagcgcc 480 
gagaacgaag aaatggaaat gcaaatccct tlttaattat Cttgcttcgg "gcttcagt 540 

g aacccct tattagttgt aggaaacaga tctcaaacag 600 
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cagttttgtt gacaagaccg caggaaaacg 
gcggtcgacc agacggttcc caaagggcgc 
ggttcccggg tttcctaaga ctctcagctg 
gtggctcctg cgtttccccc tggcgcacgc 
cgagcaggaa gaggcggagc gcgggacggc 
ccaccgcgcc acttggcctg cctccgtccc 
cgcgccactt cgcctgcctc cgtcccccgc 
gctgccgcgc cggcccttgc cccccgccgc 
gcacggggag ctgcagtacc tggggcagat 
ggacgaccgc acgggcaccg gcaccctgtc 
gagaggtgac gccgcgggcc cctgcgggac 
ggagagcgct cgggagctgc cgggcgctgc 
ccagggaggg gacgcatcgt cctcctcgcc 
gggacgtgac tggcgcgggc aacacacaca 
cagcggctcc gcggccgggc tcgcagtcgc 
gtcatcgggc agcgtttgcc cagtgctgga 
cgcgccggtc tcaaagtcct ggctttggcc 
gcttcgcagc gttttcaaaa actggagcga 
aagaggacag cactgaagct ggcgcgggaa 
cccacgaacc agctttcctc ttaaaccttg 
gtcgtccttt cctctttcct ttccgacagg 
cattggcgcc aggctttcag gggacagtgg 
cagccgttgg ccctccctaa ggccacaccg 
gcgggggagg ggactcgaag gtgtgtgagc 
ggagcgcagc cttgacacag gggtggaggt 
aagcagattc actgtagcta gcggaaaagc 
atacatagca gctgctgtgg ctgattggcg 
gagggatcca cttttctgca gctccaagcc 
ttttaaccca cctttctgct tagacattga 
tgttaaaaat ggagaaaata atgcttagta 
aaatctctct gttgtctgaa tataaactgt 
gcatttaaag ataagttttt acactttcat 
cctttttgtt cctgggacac atgcttggtg 
actagtgggc agtggatgat agctcctcct 
gccaccatct cactgtgtgg aattcctgtg 
tctcagcctg aatcgggcca ctgatgggac 
ggcccacttt gcgggagtct aggctttctg 
tttcttccct gaagaaagtt ggatggcatg 
aaattgtttt tcagatgaat tccctctgct 
tttggaggag ttgctgtggt ttatcaaggt 
tctgttctca acacagcagc cagtgagatc 
ggctcacgcc tgtaatccca ccgctttggg 
tttgggacca gcctggccaa catggcgaca 
agccaggtgt gctggtgcat gtctgtaatc 
ttgctcacga ggcggaggtt gtagtgagct 
acagagggag aacccatgtc aaaaacaaaa 
ttcctcaccc tcaagccctt agtggctcca 
gtgtccgttt ttcagctctg accttagctg 
gagtttttga gcacaccggg acatccccac 
ttcttccttt gagtctctac tccactcggg 
tgtgactctc ccccaacctc cttggtgttt 



tgggaactgt gctgctggct tagagaaggc 660 
agtccttccc agccaccgca cctgcatcca 720 
tggccctggg ctccgttctg tgccacaccc 780 
tctctagagc gggggccgcc gcgaccccgc 840 
cgcgggaaaa ggcgcgcgga aggggtcctg 900 
gccgcgccac ttggcctgcc tccgtcccgc 960 
ccgccgcgcc atgcctgtgg ccggctcgga 1020 
acaggagcgg gacgccgagc cgcgtccgcc 1080 
ccaacacatc ctccgctgcg gcgtcaggaa 1140 
ggtattcggc atgcaggcgc gctacagcct 1200 
gggtggcggg aaggagggag gcgcggctgg 1260 
ggaccccgtt tagtcctaac ctcaatcctg 1320 
ttacagacgc cgaaacggag ggtcccatta 1380 
gcagcgacag ccgggaggta agccgcgtcc 1440 
cccagtgatg ccgtggcccc cgaggcgggc 1500 
gggttaggga gagctgcctg ggcttgaccg 1560 
cctcctccgt tttcccctgt ggaccattcc 1620 
aagtgatgtg ggcggggcaa aggcggcggg 1680 
cttggtttcc tggtggcctc ccatccaatc ,1740 
aaaagagaaa ttcgggagtt cgagttctta 1800 
agcaccccag gcaaaaaatg tctcgcgggt 1860 
ggcggggcgg ggtgggcaca ggacgttagg 1920 
tcctgccgtc ctggatcctg cgccagctgc 1980 
caggggctga ccttgaccgc tcagataaat 2040 
ggttttgaat ggggaaaccc attcgtggtg 2100 
cctccggccc acggacccat ctagagacga 2160 
tgggacagcg tggggagttt tgtctgagga 2220 
caggggcctt tgatgagcca tagacctcat 2280 
gcaagttact tctcatatag cttccctata 2340 
ggcaattctg ataaaagcag gtgcttgcaa 2400 
accacaagcg agtgcggatg aacgaggact 2460 
ttctctgtgg ctcgacactt ctgatgcctc 2520 
ttgtcttcac acctttgtga caggattagc 2580 
cccttttgcc acatgttcat ccctgccctc 2640 
tccactggtc accggggcac agaagtgctg 2700 
ttgcagcctg ggagctccac cgtgatctct 2760 
gatgctccag gcctcacgtc ccagggcagt 2820 
atctgtcttc ccatcttgaa accgtatggc 2880 
gacaaccaaa cgtgtgttct ggaagggtgt 2940 
aaagaagtcg ctgctattag aagtcagtag 3000 
ctttcaaaac tcaaagcagc caggtgtggt 3060 
aggctgagtc agatcacctg aggttaggaa 3120 
ccccagtctc tactaataac acaaaaaatt 3180 
ccagctactc aggaggctga ggcatgagaa 3240 
gagatcgtgg cactgtactc cagcctggcg 3300 
aaagacacca ccaaaggtca aagcatatca 3360 
tttcactcag taagagccac ggtccttatg 3420 
ctgctctctg caccaccctg ctgttcttgt 3480 
tccctggaac cttcttcccc cacacttggc 3540 
caagccttcc tagacctcct gatttaaaac 3600 
ctccgtagac gaacatcacc atctgatgta 3660 
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ss;s ss r t9ct " ct ™ 

Sgttcccggg tttcctaaga ctctcagcto SSS agCcaccgca cctgcatcca 720 
gtggctcctg cgtttcccL tggcgcacg? tcSSS CtCCgttctg ^ccacaccc 780 
cgagcaggaa gaggcggagc gcgggacggc S 9 g 999Sc C gcc gcgaccccgc 840 
ccaccgcgcc acttggcctg ccSSSS IS*"** Wfcgcgegga aggggtcctg 900 
cgcgccactt cgcctgcct? cgtcccccgc cc22 ^ ttggcctgcc tccgtcccgc 960 
gctgccgcgc cggcccttgc cSccgccgc 3tgcct ^ ccggctcgga 1020 

gcacggggag ctgcagtacc tggggcagft ccaacacS gaCgccgagc cgcBteegcc 1080 
ggacgaccgc acgggcaccg gJaJcctgtc ggtatt™ C ' CCgctgc 9 gcgtcaggaa .1140 
gagaggtgac gccgcgggcc cctgcgggac SgSJSE f 9CaggCgc gctacagcct 1200 
ggagagcgct cgggag Ctgc cgggc£?g1 2Sccc2S a3ggagggag ^^gcggctgg 1260 
ccagggaggg gacgcatcgt ccScJcJcc SacaoSJ' tagtCCtaac ctcaatcc.tg 1320 
gggacgtgac tggcgcgggc aacacacaca Jcagcgacaa ^ 3aaCggag 1380 
cagcggetcc gcggccgggc tcgcagtcgc cccfSaa^ f^ 93 "^ ^cgcgtcc 1440 
gtcatcgggc agcgtttgcc cagtgctgga Sa22££f CC9t99CCCC =S»Wcgggc 1500 
cgcgccggtc tcaaagtcct ggctttgacc ?S " 9 Wtgcctg ggcttgaccg 1560 
gcttcgcagc gttttcaaaa Ictggag?ga alTtTtT tttCCCCtgt W«ccattc? 1620 
aagaggacag cactgaagct ggcg^gggaa ffgcjgggeaa aggcggcggg 16B0 

cccacgaacc agctttcctc ttaaacc?S aaalo™ * ggtggcctc ccatccaatc 1740 
gtcgtccttt cctctttcct ttccgacagg JgSSSJ "* gggagtt -gagttctta 1800 
cattggcgcc aggctttcag gggacaat™ 39 9 gcaaaaaa tg tctcgcgggt 1860 

cagccg ttgg ccctcccta^ g^a^'- "f^f* 1920 
gcgggggagg ggactcgaag gtgtgtgag? caSac?^ CtggatCctg <*ccagctgc 1980 
ggagcgcagc cttgacacag gggtggaggt S????" 9 * ^cagataaat 2040 

aagcagattc actgtagcta gcgglaaagc 9gggaaacc <= attcgtggtg 2100 

atacatagca gctgctgtgg ctgattgg^g tgggSS aC9gacccat "agagacga 2160 
gagggatcca cttttctgca gctccaagc? Sg^tt S??™*" ^ctgagga 2220 
ttttaaccca cctttctgct tagacat?ga gcaagttact 5 gatgagcca tagacctcat 2280 
tgttaaaaat ggagaaaata atgcttagS ggcaa"c?a 'f Catatag "tccctata -2340 
aaatctctct gttgtctgaa tataaac^gt accacaaaS f aaaagcag **cttgcaa 2400 
gcatttaaag ataagttttt acactttcat ttctctS agtgcggat 9 aa cgaggact 2460 
cctttttgtt cctgggacac atgcttggtg EtafccSSr CtC ? acactt ct 9 at 9<=«c 2520 
actagtgggc agtggatgat agctccSc! ccKtS aCC " tgtga "Wattage 2580 
gccaccatct caccgtgtgg aattcctgtg tccactStc ac,,tfftte * t ccctgccctc 2640 • 
tctcagcctg aatcgggcca ctgatgggac S acc 39gg<=ac agaagtgctg 2700 
ggcccacttt gcgggagtct aggctt?ctg aataS? 9gagctccac cgtgatctct 2760 
tttcttccct gaagaaagtt ggatggcatg JtcSS? 9 9CCtCacgtc ^agggcagt 2820 
aaattgtttt tcagatgaat SccS^rt gacaacca^ CCatCttgaa «**tatggc 2880 
tttggaggag ttgctgtggt ttateaaggt Ea^^ ■f? tB * ttc£ 9gaagg ^t 2940 
tetgttctca acacagcagc cagtgaga^c S " gCtattag aa gtcagtag 3000 
ggctcacgcc tgtaatccca ccgc^t^ggg aggctaaotc tCa3agCagc caggtgtgtft 3060 
tttgggacca gcctggccaa ca?gg C gaca ccccaaSc aga , tCacctg a ggttaggaa 3120 
agccaggtgt gctggtgcat gte™tCS ccagctact ■ taCtaataac a <= aa aaaa t t 3180 . 
ttgctcacga ggcggaggtt gtagtgagct gaaatcat™ a " aggct 9 a ggcatgagaa 3240 
acagagggag aacccatgtc aaaaaLaaa aaagacaS f Ctgtactc ^cctggcg 3300 
ttcctcaccc tcaagccctt agtggctcca ttKJr^ f aaag ^ tca aagcatatca 3360 
gtgtccgttt ttcagctctg a?c??agctg ctac?Sr? S t3agagccac ggtccttatg 3420 
gagtttttga gcacaccggg acatccccac t2££2£ Ctgttc "gt 3480 

ttcttccttt gagtctctac tccactcggg caagc^tcr " tCttCCCC cacacttggc 3540 
tgtgactctc ccccaacctc cttggt g??? "™ gat.aaaac 3600 
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tgtcagcctt tcccttcccc tgttagaagg 
tgtaagcttt atgagggcag aggatttgtt 
tcaaacacag cctgccacat agtaggagtc 
acctgtgttc ccatgttcat ataaattcta 
cttccagagg gctgagtagg tacctcaggc 
gcccagacca agcttctctg tgttccgtgt 
accagtgctt tttggaaggt ggctgaggtc 
gggatccctt ctatagagca cttggtatgg 
agccctctta tcccatttta gaaagggctt 
gctctgtatt cttggggtat tttgtt.ctgt 
ggatttgaaa catatcatca gaaacctcag 
ggtctgaatg gatgaaaagg agagactttt 
ttgttggaaa tttctcaatt ggagaaattt 
ttctcaatca aagaaaatcg ggacaaactc 
agaaataaaa tgcattcccc tgtttcaact 
cttgccagga atgtcagact caccttgata 
aatacaggag ttgaggcacc tgcactgtaa 
cctgtatgcc aggtagaaga tacattgaac 
gggtattaca tatagacaac acatttcatt 
tacttaaaag cacttggagg tgtttagcct 
agatctggga gggtggttcc aaagaaggga 
ggctgggacc acctacttaa agataggatg 
atcccagcac ttcgggaggc cgaagcgggc 
gcctgaccaa catggagaaa cgctgtctct 
ggcgcatgcc tgtaatccca gctactcggg 
ggaggtggag ggtgccgtga gccacgatcg 
gcgaaactct caaaaaacaa aaaaaaggat 
cctcctagca agtcagcagg ggccagaggc 
tgagatggct taagatttac ctggatgcct 
cacaaatgct aaagagctgt cttccaaggg 
agactttttg gacagcctgg gattctccac 
tggcttccag tggaggcatt ttggggcaga 
agaacaatgc cttccatttc cgggtgccct 
ataaggtctg ggggatgagt caatgtcaca 
gggtggtgcc aggttgaagc cacaattaac 
tttttttttt ttttaattat tatactttaa 
ttagttacat atgtatacat gtgccatgct 
catcaggtat atctcccaat gctatccctc 
agtgtgatgt tccccttcct gtgtccatat 
agaatatgcg gtgtttggtt ttttgttctt 
tttcaccacg tccctacaga ggacatgaac 
atggtgtata tgtgccacat tttcttaatc 
tccaagtctt tgcctattgt gaatagtgcc 
tagcagcatg atttaatagt cctttgggta 
tggtatttct agttctagat ccccgaggaa 
tagtttacag tcccaccaac agtgtcaaag 
ctgttgtttc ctgacttttt aatgattgcc 
gtggttttga tttgcgtttc tctgatggcc 
ttggctgcat aaatgtcttc ttttgagaag 
atggggttgt ttttttctta taaatttgtt 
ctttgtcaga tgagtaggtt gcaaaaatgt 



gggacagcag gtagtaaaag tgaaatgtgc 3720 
tctcgtgttc actgttgtat cgccagggcc 3780 
aacatatatt gatcactaaa tgtagatacc 3840 
gaagagtctc ttcagtaaca aggtgaaccc 3900 
cggggccaga gtgctgtgaa gacagcagca 3960 
cctggtctag aaccagcgat gttctttctg 4020 
tgggctcagg tctgggccat actagaagct 4080 
cttgtatggt cttggggcaa gccagaccca 4140 
caatttggat ccagccccag gtctgcctta 4200 
attggcctat cttgactaac aatgagcctt 4260 
aagacaacat tcttaaactg gctagagcct 4320 
gaagcaatat gtaaaagatt gagaaatgat 4380 
ctttgatttg ttggaaattt ctttgattct 4440 
aacaatagaa agggaggaag caagatactc 4500 
taatgcttca attcaggatt ctaaggaatc 4560 
gttggagtta ctccattggt gactcgatca- 4620 
aatactgatt agtctgatca ttaggaatat 4680 
agattgcatg taggcattaa attcattttg. 4740 
aagaaacata aaactgtcag atcggtggaa 4800 
aaaaagctta gttgagggga atggaagaaa 4860 
tcagactatc ctaaagccct caggaatctg 4920. 
ggcagctggg tgtggtggct cacgcctgta 4980 
ggatcacctg aggtcaggag ttcgaggcca 5040 
actaaaaata caaaattagc tgggtgtagt 5100 
aggctgaggc aggggaatcg cttgaacctg 5160 
cgccattgca ctccagcctg ggcaacaaga 5220 
gggttccata tgggtggtgt caagtgccca 5280 
ccttgtaagt ggtgtctcgg ggggatcaac 5340 
gctctgctct ccccatctct tccagggatc 5400 
agtgaaaatc tgggatgcca atggatcccg 5460 
cagagaagaa ggggacttgg gcccagttta 5520 
atacagagat atggaatcag gtgaggagat 5580 
tcctagcacg tgtttgctcc gttgttttag 5640 
ggagctgatg tatagctttg accttgtgag 5700 
gcctactgaa ggccgtttca catctttttt 5760 
gttttagggt acatgtgcac aatgtgcagg 5820 
ggtgcgctgc accactaact caccatctag 5880 
ccccctcctc ccaccccaca acatccccag 5940 
gttctcgttg ttcgattccc actatgagtg 6000 
gcgatagttt actgagaatg atgatttcca 6060 
tcatcatttt ttatggctgc atagtattcc 6120 
cagtctatca tgttggacat ttgggttggt 6180 
acaataaaca tacgtgtgca tgtgtcttta 6240 
tatacccagt aatgggatgg ctgggtcaaa 6300 
tcgccacact gacttccaca atggttgaac 6360 
tgtcctattt ctccacatcc tctccagcac 6420 
attctaactg gtgtgagatg gtatctcatt 6480 
agtgatggtg agcatttttt catgtgtttt 6540 
tgtctgttca tgtccttcgc ccactttttg 6600 
tgagttcatt gtagattctg gatattagcc 6660 
tctcccattt tgtgggttgc ctgttcactc 6720 
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2£SSt ISSSS 2SSS SS"£ 9t - taaaag , 720 

tcaaacacag cctgccaca? ag^aggagtc aSKtatt ^ 3780 
acctgtgttc ccatgttcat ataaSSta oaa^otS^ gatcactaa * tgtagatacc 3840 
cttccagagg gctgagtagg tacctSgc cggggccaoa atac?^* 39gtgaaccc 3900 
gcccagacca agcttctctg tgttccgtqt ccS^? gacagcagca 3960 

accagtgctt tttggaaggf ggctgaggS 2222 aa ' Cagcgat ffttctttetg 4020 
gggatccctt ctatagagca cltggtSgg SSSSS ^ aeaat a «agaagct 4080 
agccctctta tcccatttta gaaadggc?? caa?tS Ctt9gggc:aa 9=cagacc G a 4140 
gctctgtatt cttggggtat tttg?t?tgt a <=cagccccag gtctgcctta 4200 

ggatttgaaa catatcatca gaaacctcag aSSSE *" 9actaac aatgagcctt 4260 
ggtctgaatg gatgaaaagg agagactttt Kaacaa^ t f taaact ^ BCtagagcct -.4320. 
ttgttggaaa tttctcaat? ggagaaattt £££££ ?" aaaagatt ^gaaatgat 4380 ' 
ttctcaatca aagaaaatcg gg a ?aaac" «c2S2- ttggaaat " ctttgattct- 4440 
agaaataaaa tgcattcccc ^gtttcaact taatact?« f 9999 " 999 "agatactc 4500 . 
cttgccagga atgtcagact caccttgata a " Caggatt ctaaggaatc 4560 

aatacaggag ttgaggcacc tgcac^aa f^f ^^ gactcgatca 4620 

cctgtatgcc aggtagaaga tacattgaac agaltg^ t^"^" ttaggaatat ««0 
gggtattaca tatagacaac acatttLtt aagaaacata aS^ attcatt t,tg 4740 
tacttaaaag cacttggagg tgtttaqcct lllttn^ aaactgtca 9 atcggtggaa 4800 
. agatctggga gggtgg?tc? aaagaaggga tcagaStc * ttgagggga a ^gaagaaa 4860 
ggctgggacc acctacttaa agaLacaS ™ „ t ct ***9cc C t caggaatctg 4920 
atcccagcac ttcgggaggc SSJSgg? SSSS a tgtggtggct eacgcctgj 4980 
gcctgaccaa catggagaaa cgct'St S2iS2 a " tCaggag "cgaggcca 5040 
ggcgcatgcc tgtaatccca g?taSggg Caaaatta 9<= tgggtgtagt 5100 

ggaggtggag ggtgccgtga gccac^S ZTclttart ! 99ggaatc 9 "tgaacctg 5160 
gcgaaactct caaaaaacaa LaaaagS gggttc^ta IS^T? " CaaCaaga 5220 
cctcctagca agtcagcagg ggccagaggc S?tot2o? * ggg * 9gtgt ««Stgccca 5280 . 
tgagatggct taagatttac c?g ga £St gctcSSS 9gtgtctc 99 ggggatcaac. 5340 
cacaaatgct aaagagctgt ctKaggg agSaatc tal?^ tcca ^gatc -5400 
agactttttg gacagcctgg gattctccac lllllt^ tgggat sr cca atggafccccg 5460 
tggcttccag tggaggca?? ?tg£gS£ SEESS ItTT" 9 gCCCagt "a 5520 . 
agaacaatgc cttccatttc cgg^JcSt SS^n ? tg f aat «g gtgaggagat 5580 
ataaggtctg ggggatgagt caatgJcaca g£gc?gS 2S2?J? C 9ttgttttag 56« . 
gggtggtgcc aggttgaagc cacaattaac acc?«?™! tatagc ^9 accttgtgag 5700 
tttttttttt ttttaattat tatacfcttaa SttSf? 9gccgtttca "tctttttt: 5760 
ttagttacat atgtatacat gtgccatgc? aCat9tgCaC aa ^9cagg 5820 

catcaggtat atctcccaat gctatccctc S2 accacta *« caccatctag 5880 
agtgtgatgt tccccttcct gtgtccatat ottctS* ^ aCCCCaca ^atccccag 5940 
agaatatgcg gtgtttggtt StbtSSS "^ 9attCCC ^tatgagtg 6000 . 

tttcaccacg tccctacaga ggacatgaac ?cS2KS f^^aatg atgatttcca 6060 
atggtgtata tgtgccacat tttcSatc t ttat 9^tgc atagtattcc 6120 

tccaagtctt tgc?tattg t gaatagtgcc aSat^f 3 f 9ttggacat ttgggttggt 6180 
tagcagcatg atttaatagt cc"tgggta ITtT^ t tacgtgtgca tgtgtcttta 6240 
tggtatttct agttctagat ccccgaggal tcg'cSt ES""** Ctgggtcaaa "00 
tagtttacag tcccaceaac agtgLSag ScctaSt Itrrt^ atggtt ^ac 6360 
etgttgtttc ctgacttttt aatgattgcc a?2Sic2 tctccagcac 6420 

gtggttttga tttgcgtttc tctgatggcc StSJ^S 9t9t ? agatg Statctcatt 6480 . 
ttggctgcat aaatgtcttc ttt ? tgtctgtS "atcc^"" "^ttt 6540 

atggggttgt ttttttctta taaatttgtt Sa2?2^ tgtCC,;tcgc ccactttttg 6600 
ctttgtcaga tgagtaggtt gcaaaaatgt tctcccattt Saa^f 9 gatattagcc *660 

y cccacct tgtgggttgc ctgttcactc 6720 
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tgatggtagt ttcttttgct gtgcagaagc 
ttttggcttt tgttgccatt gcttttggca 
cctgaatggt aatgcctagg ttttcttcta 
agtctttaat ccatcttgaa ttgatttttg 
gctttttaca tatggctagc cagttttccc 
cccattgctt gtttttctca ggtttgtcaa 
tttctgaggg ctctgttctg ttccattgat 
ctgttttggt tactgtagcc ttgtagtata 
ctttgttctt ttggcttagg attgacttgg 
actttaaagt agttttttcc aattctgtga 
cattgaatct ataaattacc ttgggcagta 
cccatgagca tggaatggtc ttccatttct 
gtttgtagtt ctccttgaag aggtccttca 
ttattctctt tgaagcaatt gtgagtggaa 
tgttattggt gtataagaat gcttgtgatt 
tgctgaagct gcttatcagc ttaaggagat 
atacaatcat gtcgtctgca aacagggaca 
cctttatttc cttctcctgc ctaattgccc 
ggagtggtga gagagggcat ccctgtcttg 
tttgcccatt cactatgata ttggctgtgg 
aatatgttcc atcaatacct aatttattga 
tgtcaaaggc tttttctgca tctattgaga 
ttatatgctg gattacattt attgatttgc 
tgaagcccac atgatcatgg tggataagct 
gtattttatt gaggattttt gcatcaatgt 
ttttggtgtg tctctgccca gctttggtat 
agggaggatt ccctcttttt ctattgattg 
ctctttgtac ctctggagaa ttcggctgtg 
ggtaagctat tgattattgc cacaatttca 
acttcttcct ggtttagtct tgggagagtg 
agattttcta gtttatttgc gtagaggtgt 
tctgtgggat cggtggtgat atccccttta 
tctttttctt tattagtctt gctagcggtc 
agctcctgga ttcattaatt ttttgaaggg 
ctctgatttt agttatttct tgccttctgc 
ctagttcttt taattgtgat gttagggtgt 
gggcatttag tgctataaat ttccctctac 
ggtatgttgt gtctttgttc ttgttggttt 
cgttatgtac ccagtagtca ttcaggagca 
tttgagtgag attcttaatc ctgagttcta 
ttgttataat ttctgttctt ttacatttgc 
cggttttgga ataggtgtgg tgtggtgctg 
ggagttctgt agatgtctat taggtctgct 
tccttgttga ctttctgtct cgttgatctg 
ccattattaa tgtgtggagt ctaagtctct 
tgggcgcttg gcactttcca tactgtgtca 
gctggggaat gggaagttca tcggtgggac 
ggctttaatc tccctttcga ggctgagcca 
cgcagtacca gggtcacact ccactcccag 
ctcactcctg gacttccact ttcctgtcac 
gagcccagcc cctcccacct gtgcctagga 



tctttagttt aattagatcc catttgtcaa 6780 
taggcatgaa gtccttgccc atgcctatgt 6840 
gggtttttat ggttttaggt ctaacgttta 6900 
tataaggtgt aaggaaggga tccagtttca 6960 
agcaccattt attacatagg gaatcctttc 7020 
agatcagata gttgtagata tgcggcgtta 7080 
ctatgtgtct gttttggtac cagtaccata 7140 
gtttgaagtc aggtagcgtg atgcctccag 7200 
cgatgcgggc tcttttttgg ttccatatga 7260 
agaaagtcat tggtagcttg atggggatgg 7320 
tggccatttt cacgatattg attcttccta 7380 
ttgtatcctc ttttatttca ttgagcagtg 7440 
catccctttt aaggtggatt cctaggtatt 7500 
gttcactcat gatttggctc tctgtttgtc 7560 
tttgcagatt gattttatat cctgagactt 7 620 
tttgggctga gacaatgggg ttttctagat 7 680 
atttgacttc ctcttttcct aattgaatac 7740. 
tggccagaac ttccaacact atgttgaata 7800 
tgccagtttt caaagggaat gcttccagtt 7860 
ctttgtcata gatagctctt attattttga 7920 
gagtttttag catgatgtgt tgttgaattt 7980 
taatcatgtg gtttttgtct ttggatctgt 8040- 
gtatattgaa ccagccttgc atcctaggga 8100 
ttttgatgtg ctgctggatt cggtttgcca 8160 
tcatcaagga tattggtcta aaattctctt 8220 
caggatgatg ttggcttcat aaaatgagtt 8280 
gaatagtttc agaaggaatg gtaccagttc 8340 
aatccatctg gtcctggact ctctttggtt 8400 
gctcctgtta ttggtctatt cagagattca 84 60 
tatgtgtcaa ggaatttatc catttcttct 8520 
ttgtagtaat ctctgatggt agtttgtatt 8580 
tcatttttta ttgcgtctat ttgattcttc 8640 
tataaatttt gttgatcctt tcaaaaaacc 8700 
ttttttgtgt ctctatttcc ttcagttctg 8760 
tagcttttga atatgtttgc tcttgctttt 8820 
caattttgga tctttcctgc tttctcttgt 8880 
acactgcttt gaatgtgtcc cagaggttct 8940 
caaagaacat ctttatttct gccttcattt 9000 
ggttgttcag tttccatgta gttgagcagt 9060 
gtttgattgc actgtggtct gagagatagt 9120 
tgaggagagc tttacttcca actatgtggt 9180 
aaaaaaatgt atattctgtt gatttgggat 9240 
tggtgcagag ctgagttcaa ttcctgggta 9300 
tgtactgttg acagtgggtg ttaaagtctc 9360 
ttgtaggtca ctcagatgat tggcacttac 9420 
tcggcagata gctgcatggt tggtgttcgt 9480 
aaggacaaaa tgcccccatt gctttgttgt 9540 
cagcgtgctg taggtggcgc tgctgtgaag 9600 
ctctgcagag gtggagaaag aatgaaacat 9660 
tgttggtgtc acctcttact ggatgtcaca 9720 
aaagcagatg ccaccttgga atgtggggtt 9780 
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' SS^SS aa ttag a tcc ca tttgtcaa 6780 

cctgaatggt aatgcctagg ?tttct?cta ggaltttllt atgcctatg t 6840 

agtctttaat ccatcttgaa ttgatttttg £S2££ 9gttttaggt ^^aacgttta 6900 
gctttttaca tatggctagc cagttttS lac^ff^ aa ^ aa 99^ tccagtttca 6960 
cccattgctt gtttttctca ggfcttgtcaa lalt,, \ attaca tagg gaatcctttc 7020 
"tctgaggg ctctgttctg ttccaSaJ Stgtatc? tgcggcgtta 7080 

ctgttttggt tactgtagcc ttgtacrtata "?J gtgtct Sttttggtac cagtaccata 7140 
ctttgttctt ttggcttagg attgactS S^ 9 **^ aggtagcg t g atgcctccag 7200 
actttaaagt .gttttttcc SScat 5 C \ tttttgg "ccatatg? 7260 

cattgaatct ataaattacc ttgggcagta tggtagct tg atggggatgg 7320 

cccatgagca tggaatggtc ttSS£? cacgatattg attcttccta 7380 

gtttgtagtt ctccttgaag aggtccttca calccctttt 111 f ttgagcag ' g : 7440 
ttattctctt tgaagcaatt gtgagtggaa g^cactcat aaggtggatt "taggtatt 7500 
tgttattggt gtataagaat gcttgtgatt ?t2aS S 9 " C tctgt "gtc 7560 
tgctgaagct gcttatcagc ttaaggagat tttoS?S 9attttatat ^tgagactc 7620 
atacaatcat gtcgtctgca aacagggaca atSacS SSS™ ttttCCagat '"0 
cctttatttc cttctcctgc ctaattqccc tSS ctcttttcct aattgaatac 7740 

ggagtggtga gagagggclt ccctgStg tgS?*^ ttCCaaCact ^"gaata 7800 
tttgcceatt- cactatgata ttggctgtgg c?ttat^ caaaggga *t gcttccagtt 7860 
aatatgttcc atcaatacct aaStattga S^ttta* ITT aCtattttga ^920 
tgtcaaaggc tttttctgca tctattgaqa tS^^^ 9 ' catgatgtgt tgttgaattt 7980 
ttatatgctg gattacattt attgaStgc g^ta'toaf f ttttgtCt "ggatctgt ,8040 
tgaagcccac atgatcatgg tggataagct ttttaatat* ^ agCCttgc atcctaggga 8100- 
gtattttatt gaggattttt gcatcaatgt tcaSaS EST**"" Cggtttgcca 8160 
ttttggtgtg tctctgccca gctttggtat cagJatoKa ! attggtCta aa "tctctt 8220 
■ agggaggatt ccctcttttt ctattgattg " ggGCtcat aaaatgagtt 8280 

ctctttgtac ctctggagaa ttcggctgtg 2£2£2 a ? aa ? gaatg ^ccagttc 8340 
ggtaagctat tgattattgc cacStt^ca gctcct^ta CtCtttggt = 84 °° 

. acttcttcct ggtttagtct tgggagagtg ?ataSS£ ttggtctatt cagagattca 8460 ■ 
agattttcta gtttatttgc gtfgagg?g? 9 ? aatttatc "ttMttct 8520 

tctgtgggat cggtggtgat atc^S S ' tCtgatggt ^"tgtatt 8580 
tetttttett tattagtctt gctagcggtc tataaatJtt l^T^ tCgattcttc «««<> 
agctcctgga ttcattaatt ttttgaaggg t^ttt^r gtt ff«t tcaaaaaacc 8700 
ctctgatttt agttatttct tgcc?tc?re taoctt^L C h tC f atttcc tt cagttctg 8760 
ctagttcttt, taattgtgat gttagggtgt caatttS, * t ? tgtttgc tcttgctttt 8820 
gggcatttag tgctataaat ?tccStac acact^cttt ITH^ tttctc " g ' 8880 
ggtatgttgt gtctttgttc ttgttggtJt Jaaaaaa^J gaatgtgtcc ^agaggttct 8940 
cgttatgtac ccagtagtca ttLgglgca ggttgttcac ^" atttCt g =cttcatt t 9000 
tttgagtgag attcttaatc ctgagttcta llt^lT, 9 tttccatgta Sttgagcagt 9060 
ttgttataat ttctgttctt tt2£2£ aC , tgtggtct ^gagatagt 9120 

cggttttgga ataggtgtgg tgtggtgc^g SaaaaStat Z T^* actatgtgg t 9180 
ggagttctgt agatgtctat tagScS? tggtacS* atattctgtt gatttgggat 9240 
tccttgttga ctttctgtct cg??g at ctg S£S Ctgagttcaa "cctgggta 9300 
ccattattaa tgtgtggagt ctaagtc^c? S^aa^ 9 3 f gtgggtg "aaagtctc 9360 
tgggcgcttg gcactttcca tactgtgS tcgocaaat 2 ^" 9atgat tggcacttac 9420 
gctggggaat gggaagttca .tcggjgggac aaSacaaa 3 9CtgCatggt tggtgttcgt 9480 
ggctttaatc tccctttcga ggc?glg?ca caocatac^ ^ 9CCCCCatt gcttttfttgt 9540 
cgcagtacca gggtcacact ccac?c 9 cag ctctccf.S ta " tggcgc tgctgtgaag 9600 
ctcactcctg gacttccact ttcctgtca? StS^ 9t9gagaaag ^tgaaacat 9660 
gagcccagcc cc.cccacct gtgcc?^ 9gatg ^ a « 9720 

y y Lg ccaccttgga atgtggggtt 9780 
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tgtgtgtgca atttactagc tgggcagaga ccagcaacct ggagagcagg tgtctcgtct 9840 
aaggggacag tcacatttca cctccagcca cctggaggaa tttgggcctg gtgatgtcag 9900 
aattcttcaa taaaagccta aaatctatat tttatgtgcg gtcatgagat ctgttaaatg 9960 
ttagcaactt caggaagttt aaaaatgctg tgtggaccta gaataggcaa gttcttaaag 10020 
gcagaaagtg gaatgctagt ttccagggac tggggaacag ggaggaatgg ggagttcatg 10080 
tttaatgggc acagaggttt tgttagggat gacgaaaaag ttcgggagat ggtgatggtg 10140 
atggagatgg tgatggtgat ggagatggtg atggtgatgg tgatggtgat gggtgatggt 10200 
gatggtgatg gtgatggtga tggagatggt gatggtgatg gtgatggaga tggtgatggt 10260 
gatggtgatg gtgatggaga tggtgatggt gatggagatg gtgatggtga tggtgatgga 10320 
gatggtgatg gtgatggtga tggtgatggt gatggtgatg gtgatggaga tggagatggt 10380 
gatggtgatg gttgcctaac atcaggaacg tgcttaatgc ttctgaattg cacacaaaaa 10440 
tggcaagttt aatattatgt gtactttatc acaatgaaaa aagctgctgc gtgggccaag 10500 
ttacttgtgc aggtaatgtt ctgcaggtgg ttgcctgcac ctcagttgta gggtgtccgt 10560 
aggatgtgag gccagtcccc gggcttaatg atgctttaaa tcctgcctag tattcaatta- 10620 
tttcttgtcg cttaaaaggc ctaataaaat tatggtctta gtttacagtg gtatgaatgc 10680 
ttagctgttg gattttagta ggaaagttcg tccctttttg tttttaattt tgttttacag 10740 
attcacagga attttttttt tttttttttt tttttttttt taatgcacag aaagtttccc 10800 
tggactctct acccagtttc cccagtgata atatcttggg taacatcctg tatacattca 10860 
cattggtgca ttcctcagag ttgtcagatt ttgctagttt tacgtgcact tgtgtatgtg. 10920 
tgtatttgca attttagcac gtgtagactc ttgtaaccac tacaatcaag ttacagaact 10980 
acactaccaa ggttcatctt tttaaaatct ttgatgttac cttttttgga acagtgacca 11040 
tgagaggact ttcctcccaa aattttgaaa actactgaac cagaatatag tctgacacta 11100 
ataggtagaa atttaaccaa aggagattat gaagctctgc acttgagtta acaaaatcac 11160 
ttctcagctt ccagttccat ctcagaagga aggaaaaggg attaaaaatc cagagaccag 11220 
aaaatgggag caaagtacaa ggtggtgtaa tcattacaga ggtttcctga tgtttccaag 11280 
tcagtcgtgt gttgagctgc taaactctaa agtaatttta ggtggaatgt tggaaacatg 11340 
ctgctgaggt gatagaaagg aatccatggt cctctgttag ttggaaagta tatggaatac 11400 
tatattctac ataagataca atactctctg tgagacaagg ataaagtaga ttttgtcagt 11460 
gaaattgtga caagaatcgc tgatgggttt agagcctaag tttgcgagga gcactggaag 11520 
aaattaagat tgttgagatt ggaaagggtt agctatgggg gaacaggagg aggtgactcc 11580 
atgacagacc aaatattcaa aggactgtgt agaagaggaa aaagactttg ttagggctcc 11640 
agaggacaga gccaggagtc agacagggcc ttgaactcaa cccaccgaga tctgcaaact 11700 
ttgcaggatg caccagatgt cttgtagcca tgggtcaagg ggggaccctg ggtaagagac 11760 
tgtaatagat gacctctaag gccatctcat gacatgtgtg attaatgtat gtacctgtcc 11820 
tctctttttg acaattctac agattattca ggacagggag ttgaccaact gcaaagagtg 11880 
attgacacca tcaaaaccaa ccctgacgac agaagaatca tcatgtgcgc ttggaatcca 11940 
agaggttgaa agaaccccgt cgtcttcatt tatactaacc atactcttag agggaagcaa 12000 
tctggttttg tgcagaggca ctgagggagg caggaccctg ggcaacttcc cccagccaca 12060 
tggttgtgtg acgttgggca agtcaeattt tgctgcactt tcaccttcag atcatgaggt 12120 
tgggcccaga ggattttttt tttttttttt ttttttgaga cagagttttg ctctgttgcc 12180 
caggctggaa tgcaacggcg tgatcttggc tcactgtaac ctctgcctcc tgggttcgag 12240 
tgattctcct gcctcagcct ccaagtagct gggattacag catgtgccac catgcctggc 12300 
taattttgta tttttagtag agacgggttc acatgttggt caggctggtc ttgactcctg 12360 
accctcagat gatctgcctt gcctcagcct cccaaccgag tgatcttaag ttgtgtatta 12420 
tactcattct tacacaaaaa gggctttaaa tgcctagaaa ctacatgaag. atgttaacat 12480 
tttaaatgga agcagatgaa gttccagctc gctgccacct cactaacatt tttaacaatt 12540 
atattgtaaa attcaactct accagggtgt agagccaggt gtggtggctc acacctgtaa 12600 
ttccaacaac tccagaggcc aaggcgagag gatcatttga acccacggaa tttgaggctg 12660 
tagtgagtca tgatcacgcc attgcactcc atcctgggca acagagtgag accctgaata 12720 
tttaaaaaca acaacaacaa caaaactcta tcaggatatc ataagtactt agagtgaaat 12780 
acttgcatct -gtaatagaga cttatttttt ttttttttga gacacagtct caccctgttg 12840 
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aaggggaLg ScatttL J f^^" Sgagagcagg .^ctcgtct 9840 

aattcttcaa taaaagccta aaatctatat SEiZ?" tttgggcctg ^gatgtcag 9900 
ttagcaactt caggaagttt aaaaatgctg tgtggS Sf^^ Ctgttaa "9 ^0 
gcagaaagtg gaatgctagt ttccaggga? tggggaacaS 9aataggcaa srttcttaaag 10020 
■ tttaatgggc acagaggttt tgttagggat gSCaaS 99a " aatgg Sgaffttcatg 1.0080 
atggagatgg tgatggtgat ggaga^tg JtggtaatS " Cgggagat Sgtgatggtg 10 140 
gatggtgatg gtgatggtga tggl ga ?gg? SJSSaS ^ atggtgat ^gatggt 10200 
gatggtgatg gtgatggaga tggtgatgS SgXaata gt9 \ tggaga tggtgatggt 10260 
gatggtgatg gtgatggtga tggtgatggt gaStSS 9 ' g " ggtga ^tgatgga 10320 
gatggtgatg gttgcctaac atcaggaKg ?gc?taaS ?^! tggaga ^agatggt 10380 
tggcaagttt aatattatgt gtactttatc IrtllT "ctgaattg cacacaaaaa 10440 
ttacttgtgc aggtaatg?t ItTccllTc %?**?* e gtggg ~ aa * 105oS 

aggatgtgag gccagtcccc gggcttL?a ttf^tt ctc agttgta gggtgtccgt 10560 
tttcttgtcg cttaaaaggc SaL^at talggtctt* att^"* 9 tattcaa " a 10620 
ttagctgttg gattttagta ggaaagttcg tcccttttta f^ 3 gtat 9^tgc. 10680 
attcacagga attttttttt ttttttttt? ttttt"ttt £ it ^ tgtttCaca 9 "7.40 
tggactctct acccagtttc cccagtgata atafccS™ ^ aatgcaca 9 aaagtttccc 10800 
cat tggtgca ttcctcagag ttgt ^ a ga J a ttgct^? 5 r t aaC f CCt9 tataca "ca 10860. 
tgtatttgca attttagcac gtgtagactc ttataar^ f acgtgc ? c t tgtgtatgtg 10920. 
acactaccaa ggttcatctt ?t?aaaat£ ttSott^ ta = aatcaa 9 ttacagaact 10980. 
tgagaggact ttcctcccaa aattttoaaa SJ? ""tttgga acagtgacca 11040 
atagg taga a atttaaccaa aggagattat aaa^S^ Cagaatata 9 tctgacacta- 11100 
ttctcagctt ccagttccat cSa5«j£ aggSSa !" tgagtta acaa ^tcac 11160 
aaaatgggag caaagtacaa ggtgftg? 9 * teat** 3999 attaaaaatc cagagaccag 11220 
tcagtcgtgt gttgagctgc taaa^taa Saa"^ 3 ? 3 ggtttcct 9 a tgtttccaag 11280 
ctgctgaggt gatagaaagg aatccatgg? ^ tg 9-acatg 11340 

tatattctac ataagataca atactctctg tgaaS^f " ggaaagCa tatggaatac. 11400 
gaaattgtga caagaatcgc tgatggattt t " ataaa ^^ ttttgtcagt 11460 

aaattaagat tgttgagaL ggaaagggtt " tgcga ^a gcactgga^g 11520 

atgacagacc aaatattcaa aggac?g?gt aSaSf 9 9aaCaggagg aggtgactcc 11580 
agaggacaga gccaggagtc agacagggS SoaSSU aaagaCtttg "agggctcc 11640 
ttgcaggatg caccagatgt cttgtlgcca teS?S£* CCCaccgaga tctgcaaact U,700 
tgtaatagat gacctctaag gccatcK SSEEE gggga ~ctg ggtaagagac 11760 
tctctttttg acaattctac agattattca c2t ZJS? 9 attaat 9 tat gtacctgtcc 11820. 
attgacacca tcaaaaccaa ccctg^cgac agaSSa " gaccaact ^aagagtg 11880 
agaggttgaa agaaccccgt cgtcttcatt tCatgtgc 9 c "ggaatcca 11940 

tctggttttg tgcagaggL c££S£ ataCtCttag ^ggaagcaa 12000 

tggttgtgtg acgttgggca agLS??? t^ccaS 9gCaaCttcc "cagccaca 12060 
tgggcccaga ggattttttt tttttttttt SStaa^ tCaCCttca 9 atcatgaggt 12120 
caggctggaa tgcaacggcg tgatcttggc tcacSaac ctct ^tgcc 12180 

tgattctcct gcctcagcct ccaagtagct oi l? ctctgcctcc tgggttcgag 12240 
taattttgta tttttagtag agacggg?tc Statta" 9 CatgtgCCac ^atgcctggc 12300 
accctcagat gatctgcctt gcctcagcct ^ 3gctggtc ttgactcctg 12360 

tactcattct tacacaaaaa gggctttaaa tl^t* 9 9 tgatcttaa g ttgtgtatta 12420 
tttaaatgga agcagatgaa Sagctc Sgccfcct l^*** a tgttaacat 12480 
atattgtaaa attcaactct accaooat^ gccacct "ctaacatt tttaacaatt 12540 
ttccaacaac tccagaggcc 11^%% ^StSS 9 c 9gt9gCtC a «cc t gtaa l^o 
tagtgagtca tgatcacgcc att^cactcc ItcctagS IcT^f 9 ** tttgaggct 9 12660 
tttaaaaaca acaacaacaa caaaactcta Saa? 9 ^ a f gagtgag accctgaata 12720 
acttgcatct gtaatagaga cttatttttt ttStttaa agagtgaaa t 12780 

<-c ctttttttga gacacagtct caccctgttg 12840 
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cccaggctgg agtgcagtgg tttgatctcc gctcacggca acctccatct cccaggttca 12900 
agtgagttcc cattcctcag ccccagagct gggaccacag gcgcgcgaat ttttgtattt 12960 
ttagcagaga cggggtttca ctatgttggc caggctagtc tcaaactcaa gttggcctca 13020 
agtgatctgc ccaccctggc gtcccagtgt tgggatttca ggcatgagcc actgtgcctg 13080 
gccatgtaat agagactttt aatataggag ggtgtaccag aagcaccagt ttcctgtggc 13140 
aaacagaatt attcctgctg tatttgtaat ttggtgccac gaggtagccc agatcccttc 13200 
agctctgatg gaagagcatt gcttcagccg taaatggaca cctgcagaaa ccttgcaccg 13260 
atggatagtc tccctcagct ccgtgccatc gctgcagggg. ctgttatgga catcactgca 13320 
gcccagtggc tctctctcct ggtctccacc atatgagttg gcttctgttt ctctcctgtt 13380 
ttactttgcc tttagctgtg gtctttcaaa ccaccatccc tccttatctt cctctgctgg .13440 
ttcctcagat cttcctctga tggcgctgcc tccatgccat gccctctgcc agttctatgt 13500 
ggtgaacagt gagctgtcct gccagctgta ccagagatcg ggagacatgg gcctcggtgt 13560 
gcctttcaac atcgccagct acgccctgct cacgtacatg attgcgcaca tcacgggcct 13620 
gaaggtgggc tgtctcggga agggtgactt gccagcctac cacatgagct cttcagttct 13 680 
ttaatatggg. aaaacaaatt gcagagttta gtctctgatt agcttttaaa tttgatatgt 13740 
gcaagtaaga catgaaccag cttttacttt gaaaccttcc ttttctggaa ggttttctgg 13 800 
ccctgtggta tatgcactaa cagatctata caggttgttt gtgatacagc ttctatggat 13860 
cttctcaaaa gctatgctga ggttgggtat ggtggctcat gcctgtaatc ccagcacttt 13920 
ggaagactga gacaggagca attgcttgag gtctggagtt caataccagc ctgggcaaca 13980 
taacaagatg ctgttgctac aaaaaaatgg aaaagctaca ctaaattatt tttttaaaaa 14040 
aagccttgcg gtgtctgcat attctaatgt ttttaaatga tgttttaaag aattgaaact 14100 
aacatactgt tctgctttct cccggtttat agccaggtga ctttatacac actttgggag 14160 
atgcacatat ttacctgaat cacatcgagc cactgaaaat tcaggtaaga attagatgtt 14220 
atacttttgg gtttggtacc ttctcttgat aaaaggttga ctgtggaaca ggtatctgct 14280 
caatgctgtg tccaagataa agatgactgc tccaaatgtg gggcttcagt ttagggagaa 14340 
gtggtgggca ggtgggcagg acaaggcagg catctgcctc agcaaccatg gcacttaact 14400 
tgtcaggtgc tgtgaggtac taagcaccag taccagagag ggaagagcca cattcaagcc 14460 
aggggattgt ccaaaaggag gcattttaac tcattttaac ttgaaggaga attgaagtgc 14520 
aaatgttttt ccttttcttt ttttttgaga tggagtcttt ctctgt'cggc caggctggag 14580 
tgtgccgtgg tgcgatctca gctcactgca acctccacct cccgggttca agcaattctt 14640 
ctgcctcagc ctcccaggta gctgggatta caggcacatg ccaccacacc cagctaattt 14700 
tttgtattat tagtagagat ggggtttcgt catgttggcc aggctgatct caaactcctg 14760 
acttcaagtg taccacctgc ctcagcctcc gaaagttctg gaattacagg cataagccac 14820 
caccctggcc ataaatattt tttgttaatt ttacattaag tacaatattt aggtccaaac 14880 
ttcaaaagtc tgttgaaatc cctgaagtta tagcagccaa caattgatat gaaatggcaa 14940 
taaaaatgta agttcatctg cttcatgagc cttaaggaaa aaaactcaga accagacact 15000 
ttttagcccc ttccaggtta gatccaggtt ttaaaagtta ttcctttgag ggagtttggc 15060 
tgcttttgag tggaggtgac ttcaggctta ttctctctgg ctctctgctc tggtcatttt 15120 
tagacatagt aataggttgt gacctgtctt cacatcctaa ttgccactgt ctgttcatcc 15180 
caggaatcct ggctttcatc cctttctgtt cactgtccat gcatgtcatc tttccttctt 15240 
tctgccaggg accagatggg ttagggattg tgaattcaag taaacgtaga gctactatga 15300 
gttacagatt gactgtgttc ctgtctttaa taaatttgcc aagagtggtt ataagaactt 15360 
acacctgatg aggcaccagg ctcctgatgc tgtgtaatgt cacaaaatac ccctcactct 15420 
cgatctgtgc aagagaacag ctggttgcgc tccaatcatg ttacataacc tacgcgaagg 15480 
tatcgacagg atcatactcc tgtaaaatag aactttgttg atcacatcct gtgtacttgt 15540 
ttcacggaca tgaggagcaa ttacaacagg tcgtacaatt atggcaaaat aatggcctta 15600 
ttttgttttt agcttcagcg agaacccaga cctttcccaa agctcaggat tcttcgaaaa 15660 
gttgagaaaa ttgatgactt caaagctgaa gactttcaga ttgaagggta caatccgcat 15720 
ccaactatta aaatggaaat ggctgtttag ggtgctttca aaggagctcg aaggatattg 15780 
tcagtcttta ggggttgggc tggatgccga ggtaaaagtt ctttttgctc taaaagaaaa 15840 
aggaactagg tcaaaaatct gtccgtgacc tatcagttat taatttttaa ggatgttgcc 15900 
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cccaggctgg agtgcagtgg tttgatctcc gctcacggca acctccatct cccaggttca 12900 
agtgagttcc cattcctcag ccccagagct gggaccacag gcgcgcgaat ttttgtattt 12960 
ttagcagaga cggggtttca ctatgttggc caggctagtc tcaaactcaa gttggcctca 13020 
agtgatctgc ccaccctggc gtcccagtgt tgggatttca ggcatgagcc actgtgcctg 13080 
gccatgtaat agagactttt aatataggag ggtgtaccag aagcaccagt ttcctgtggc 13140 
aaacagaatt attcctgctg tatttgtaat ttggtgccac gaggtagccc agatcccttc 13200 
agctctgatg gaagagcatt gcttcagccg taaatggaca cctgcagaaa ccttgcaccg 13260 
atggatagtc tccctcagct ccgtgccatc gctgcagggg ctgttatgga catcactgca 13320 
gcccagtggc tctctctcct ggtctccacc atatgagttg gcttctgttt ctctcctgtt 13380 
ttactttgcc tttagctgtg gtctttcaaa ccaccatccc tccttatctt cctctgctgg 13440 
ttcctcagat cttcctctga tggcgctgcc tccatgccat gccctctgcc agttctatgt 13500 
ggtgaacagt gagctgtcct gccagctgta ccagagatcg ggagacatgg gcctcggtgt 13560 
gcctttcaac atcgccagct acgccctgct cacgtacatg attgcgcaca tcacgggcct 13620 
gaaggtgggc tgtctcggga agggtgactt gccagcctac cacatgagct cttcagttct 13 680 
ttaatatggg aaaacaaatt gcagagttta gtctctgatt agcttttaaa tttgatatgt 13740 
gtaagtaaga catgaaccag cttttacttt gaaaccttcc ttttctggaa ggttttctgg 13800 
ccctgtggta tatgcactaa cagatctata caggttgttt gtgatacagc ttctatggat 13860 
cttctcaaaa gctatgctga ggttgggtat ggtggctcat gcctgtaatc ccagcacttt 1392 0 
ggaagactga gacaggagca attgcttgag gtctggagtt caataccagc ctgggcaaca 13980 
taacaagatg ctgttgctac aaaaaaatgg aaaagctaca ctaaattatt tttttaaaaa 14040 
aagccttgcg gtgtctgcat attctaatgt ttttaaatga tgttttaaag aattgaaact 14100 
aacatactgt tctgctttct cccggtttat agccaggtga ctttatacac actttgggag 14160 
atgcacatat ttacctgaat cacatcgagc cactgaaaat tcaggtaaga attagatgtt 14220 
atacttttgg gtttggtacc ttctcttgat aaaaggttga ctgtggaaca ggtatctgct 14280 
caatgctgtg tccaagataa agatgactgc tccaaatgtg gggcttcagt ttagggagaa 14340 
gtggtgggca ggtgggcagg acaaggcagg catctgcctc agcaaccatg gcacttaact 14400 
tgtcaggtgc tgtgaggtac taagcaccag taccagagag ggaagagcca cattcaagcc 14460 
aggggattgt ccaaaaggag gcattttaac tcattttaac ttgaaggaga attgaagtgc 14520 
aaatgttttt ccttttcttt ttttttgaga tggagtcttt ctctgtcggc caggctggag 14580 
tgtgccgtgg tgcgatctca gctcactgca acctccacct cccgggttca agcaattctt 14640 
ctgcctcagc ctcccaggta gctgggatta caggcacatg ccaccacacc cagctaattt 14700 
tttgtattat tagtagagat ggggtttcgt catgttggcc aggctgatct caaactcctg 14760 
acttcaagtg taccacctgc ctcagcctcc gaaagttctg gaattacagg cataagccac 14820 
caccctggcc ataaatattt tttgttaatt ttacattaag tacaatattt aggtccaaac 14880 
ttcaaaagtc tgttgaaatc cctgaagtta tagcagccaa caattgatat gaaatggcaa 1494 0 
taaaaatgta agttcatctg cttcatgagc cttaaggaaa aaaactcaga accagacact 15000 
ttttagcccc ttccaggtta gatccaggtt ttaaaagtta ttcctttgag ggagtttggc 15060 
tgcttttgag tggaggtgac ttcaggctta ttctctctgg ctctctgctc tggtcatttt 15120 
tagacatagt aataggttgt gacctgtctt cacatcctaa ttgccactgt ctgttcatcc 15180 
caggaatcct ggctttcatc cctttctgtt cactgtccat gcatgtcatc tttccttctt 15240 
tctgccaggg accagatggg ttagggattg tgaattcaag taaacgtaga gctactatga 15300 
gttacagatt gactgtgttc ctgtctttaa taaatttgcc aagagtggtt ataagaactt 15360 
acacctgatg aggcaccagg ctcctgatgc tgtgtaatgt cacaaaatac ccctcactct 15420 
cgatctgtgc aagagaacag ctggttgcgc tccaatcatg ttacataacc tacgcgaagg 15480 
tatcgacagg atcatactcc tgtaaaatag aactttgttg atcacatcct gtgtacttgt 15540 
ttcacggaca tgaggagcaa ttacaacagg tcgtacaatt atggcaaaat aatggcctta 15600 
ttttgttttt agcttcagcg agaacccaga cctttcccaa agctcaggat tcttcgaaaa 15660 
gttgagaaaa ttgatgactt caaagctgaa gactttcaga ttgaagggta caatccgcat 15720 
ccaactatta aaatggaaat ggctgtttag ggtgctttca aaggagctcg aaggatattg 15780 
tcagtcttta ggggttgggc tggatgccga ggtaaaagtt ctttttgctc taaaagaaaa 15840 
aggaactagg tcaaaaatct gtccgtgacc tatcagttat taatttttaa ggatgttgcc 15900 
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actggcaaat 

agggtatctg 

tagcaaaaac 

cacaagctat 

aatctgaggg 

agttatagtt 

gttcattctg 

ccttaagtaa 

aataatagtg 

tataatgacc 

gggttatttt 

gacttctcta 

aatcagtgaa 

ttttattctg 

tgcaacatgt 

gctatttttg 

gattaagtag 

attcctgagg 

ccatttttgt 

ttgttggggc 

tctggagact 

tatcaaagac 

gcatttatgc 

tttttttttg 

ggctcactgc 

agctgggatc 

cagggtttca 

cctcagcctc 

atatttttaa 

tggctacttg 

ggtgaaaatg 

gtagtccaaa 

agtgcaaatg 

aggcaattga 

aagaaccgtt 

tctagttcac 

ggcacagctt 

atgctcatgc 

aatgatgcct 

cactcaattg 

cctaaacctc 

ttgatgacat 

caattcagac 

atagattaaa 

atggctgcag 



gtaactgtgc 

acaatgctga 

atgtatgtgc 

ttttggaata 

agctgagtaa 

gttttatatg 

tactgccact 

acatgtgctg 

aatatatttt 

atttaggata 

aaaataatct 

ataatatgct 

atatttttcc 

ctaatttatg 

tgcttatttt 

taagaacatc 

gataacgtgc 

gtggtatgac 

aaaactattt 

tgagcactta 

ggtgtttctt 

caaaaaaatg 

tgctccaaga 

acatggagtc 

aacctccacc 

acaggcacct 

ccatgttggc 

ccaaagtgtt 

aataagtttg 

ccagagttag 

agccatgatc 

ctttttaggt 

ttaagatctc 

acctccaagc 

tgttagtgga 

gagacaggcg 

accttgggag 

aggtggtcaa 

gaagtcagat 

acagtgttgg 

tgtgcctcag 

gaacacacga 

tctgtaaata 

atcaccacat 

cagctgccag 



cagttctttc 

ggttatgaac 

atttcaatcc 

tttttagaat 

caccatcgat 

ttgctataat 

tatctgctca 

tattctggtt 

gccctatttt 

gagttttttt 

atggactacc 

tagattaagc 

cttcgttgca 

acaagtgtta 

caaattacag 

ctcctggact 

attgatttgc 

ttctagctga 

ccaagaaatt 

attttcttga 

tacagattcc 

agtcctgtta 

aatgcttrtta 

actgtccgtt 

tcccaggttc 

gccaccatgc 

caggctggtc 

gggattacag 

gagactgttg 

ttaactcatc 

tcttgaacat 

cactaacttg 

cactagtgag 

cctggccctc 

tattgcaaac 

tggcagtggc 

gcatgtagga 

catactcaca 

tattcacgtg 

ccaaggctta 

gtccttttct 

agtccctctg 

tccagaatta 

tttgtgatct 

gggcttgggg 



cataataaaa 

aaagtgagga 

cacgtactta 

attttaagaa 

catgatgtag 

aaagaagtgt 

gttccttcct 

tggatgctac 

tctcatttta 

tttttttttt 

attttgccct 

aaggaaaaga 

taccagatac 

aacagaacaa 

tttaatgtct 

ttgggttagt 

taaaagaatc 

actcatcttg 

ttaagccctt 

gcaggaagga 

tccttcattt 

acaaccacct 

cgtctaagcc 

gcccaggctg 

aagtgattct 

ccggctaatt 

tcaaacacct 

gcgtaagcca 

gaaataatag. 

aaactctttg 

gatcagaata 

ctagatggtg 

gaaaggctag 

aggcctgagg 

aaacaggagt 

tggcagcatc 

agcccgctgg 

caccctagga 

gtgctaactt 

gctgtgttac 

tctaaaatat 

agaggttttg 

tttgggttcc 

atttttcaag 

tttttttggc 



ggctttgagt 
gaatgaaatg 
taaagaaggt 
tttcacaagc 
agtgtggtta 
tctgcattcg 
aaaatagatt 
ttaaaagagt 
actgcatctt 
taaactttta 
cattagcttc 
tgcaaaacca 
ccccggtgtt 
ggaattattc 
aggtgccagc 
taaatctaaa 
aagtaataat 
atcggtagga 
tcacttcaga 
gtttcttcca 
ctgttgagta 
ggaacaaaaa 
agaggcaatt 
cagtgcagtg 
cctgcctcag 
ttttgtattt 
gacctcaaat 
ccatgcccag 
ggcagaggaa 
ataatagttt 
aatgccccag 
ccaggttttt 
tattacagaa 
attttgatac 
caaagcttgg 
tcttctcaca 
atcatcacgg 
ggagggaatc 
aaagcagaag 
catgcgtttc 
agcaatgtga 
tggtgccctt 
tctggtcaaa 
aagcgtttgt 
aggtagggtt 



taactcactg 
tatgtgctct 
tggtgaattt 
tattccctca 
tgaactttaa 
tccacgcttt 
aaagaactct 
atattttaga 
atcctcaaaa 
taaccttaaa 
agcatggtgt 
cttcggggtt 
gcacgactat 
caacaagtta 
ccttgatata 
cttatttaag 
tacttagctg 
ttttttaaat 
aagaaaaaag 
aacttcacca 
gccgggatcc 
cagattttat 
aattaatttt 
gcgcaatctt 
cctcccatgt 
tttgtagaga 
gatccacctg 
ccctgaatta 
catattttac 
gacctctgtt 
ccacacaatt 
ttgcacaagg 
gccttgtcag 
agacaaactg 
tgctccacag 

ggggccctca 

gatacttgaa 
agatcggggc 
gagcgagtac 
taggcaagtc 
ggtggggact 
taaaagggat 
agtcagatga 
attttttcat 
gggagg 



15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860. 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060' 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18596 
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actggcaaat gtaactgtgc cagttctttc cataataaaa ggctttgagt taactcactg 15960 
agggtatctg acaatgctga ggttatgaac aaagtgagga gaatgaaatg tatgtgctct 16020 
tagcaaaaac atgtatgtgc atttcaatcc cacgtactta taaagaaggt tggtgaattt 16080 
cacaagctat ttttggaata tttttagaat attttaagaa tttcacaagc tattccctca 16140 
aatctgaggg agctgagtaa caccatcgat catgatgtag agtgtggtta tgaactttaa 16200 
agttatagtt gttttatatg ttgctataat aaagaagtgt tctgcattcg tccacgcttt 16260 
gttcattctg tactgccact tatctgctca gttccttcct aaaatagatt aaagaactct 16320 
ccttaagtaa acatgtgctg tattctggtt tggatgctac ttaaaagagt atattttaga 16380 
aataatagtg aatatatttt gccctatttt tctcatttta actgcatctt atcctcaaaa 16440 
tataatgacc atttaggata gagttttttt tttttttttt taaactttta taaccttaaa 16500 
gggttatttt aaaataatct atggactacc attttgccct cattagcttc agcatggtgt 16560 
gacttctcta ataatatgct tagattaagc aaggaaaaga tgcaaaacca cttcggggtt 16620 
aatcagtgaa atatttttcc cttcgttgca taccagatac ccccggtgtt gcacgactat 16680 
ttttattctg ctaatttatg acaagtgtta aacagaacaa ggaattattc caacaagtta 16740 
tgcaacatgt tgcttatttt caaattacag tttaatgtct aggtgccagc ccttgatata 16800 
gctatttttg taagaacatc ctcctggact ttgggttagt taaatctaaa cttatttaag, 16860 
gattaagtag gataacgtgc attgatttgc taaaagaatc aagtaataat tacttagctg 16920 
attcctgagg gtggtatgac ttctagctga actcatcttg atcggtagga ttttttaaat 16980 
ccatttttgt aaaactattt ccaagaaatt ttaagccctt tcacttcaga aagaaaaaag 17040 
ttgttggggc tgagcactta attttcttga gcaggaagga gtttcttcca aacttcacca 17100 
tctggagact ggtgtttctt tacagattcc tccttcattt ctgttgagta gccgggatcc 17160 
tatcaaagac caaaaaaatg agtcctgtta acaaccacct ggaacaaaaa cagattttat 17220 
gcatttatgc tgctccaaga aatgctttta cgtctaagcc agaggcaatt aattaatttt 17280 
tttttttttg acatggagtc actgtccgtt gcccaggctg cagtgcagtg gcgcaatctt 17340 
ggctcactgc aacctccacc tcccaggttc aagtgattct cctgcctcag cctcccatgt 17400 
agctgggatc acaggcacct gccaccatgc ccggctaatt ttttgtattt tttgtagaga 17460 
cagggtttca ccatgttggc caggctggtc tcaaacacct gacctcaaat gatccacctg 17520 
cctcagcctc ccaaagtgtt gggattacag gcgtaagcca ccatgcccag ccctgaatta 17580 
atatttttaa aataagtttg gagactgttg gaaataatag ggcagaggaa catattttac 17640 
tggctacttg ccagagttag ttaactcatc aaactctttg ataatagttt gacctctgtt 17700 
ggtgaaaatg agccatgatc tcttgaacat gatcagaata aatgccccag ccacacaatt 17760 
gtagtccaaa ctttttaggt cactaacttg ctagatggtg ccaggttttt ttgcacaagg 17 820 
agtgcaaatg ttaagatctc cactagtgag gaaaggctag tattacagaa gccttgtcag 17880 
aggcaattga acctccaagc cctggccctc aggcctgagg attttgatac agacaaactg 17940 
aagaaccgtt tgttagtgga tattgcaaac aaacaggagt caaagcttgg tgctccacag 18000 
tctagttcac gagacaggcg tggcagtggc tggcagcatc tcttctcaca ggggccctca 18060 
ggcacagctt accttgggag gcatgtagga agcccgctgg atcatcacgg gatacttgaa 18120 
atgctcatgc aggtggtcaa catactcaca caccctagga ggagggaatc agatcggggc 18180 
aatgatgcct gaagtcagat tattcacgtg gtgctaactt aaagcagaag gagcgagtac 18240 
cactcaattg acagtgttgg ccaaggctta gctgtgttac catgcgtttc taggcaagtc 18300 
cctaaacctc tgtgcctcag gtccttttct tctaaaatat agcaatgtga ggtggggact 18360 
ttgatgacat gaacacacga agtccctctg agaggttttg tggtgccctt taaaagggat 18420 
caattcagac tctgtaaata tccagaatta tttgggttcc tctggtcaaa agtcagatga 18480 
atagattaaa atcaccacat tttgtgatct atttttcaag aagcgtttgt attttttcat 18540 
atggctgcag cagctgccag gggcttgggg tttttttggc aggtagggtt gggagg 18596 
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<400> 12 

■ accgggcaag cgggaaccag gtggccaccc ggtgtcggtt tcattttcct ttggaatttc 60 
tgctttacag acagaacaat ggcagcccga gtacttataa ttggcagtgg aggaagggaa 120 
catacgctgg cctggaaact tgcacagtct catcatgtca aacaagtgtt ggttgcccca 180 
ggaaacgcag gcactgcctg ctctgaaaag atttcaaata ccgccatctc aatcagtgac 240 
cacactgccc ttgctcaatt ctgcaaagag aagaaaattg aatttgtagt tgttggacca 300 
gaagcacctc tggctgctgg gattgttggg aacctgaggt ctgcaggagt gcaatgcttt 360 
ggcccaacag cagaagcggc tcagttagag tccagcaaaa ggtttgccaa agagtttatg 420 
gacagacatg gaatcccaac cgcacaatgg aaggctttca ccaaacctga agaagcctgc 480 ' 
agcttcattt tgagtgcaga cttccctgct ttggttgtga aggccagtgg tcttgcagct 540 
ggaaaagggg tgattgttgc aaagagcaaa gaagaggcct gcaaagctgt acaagagatc 600 . 
atgcaggaga aagcctttgg ggcagctgga gaaacaattg tcattgaaga acttcttgac 660 
. ggagaagagg tgtcgtgtct gtgtttcact gatggcaaga ctgtggcccc catgccccca 720 
gcacaggacc ataagcgatt actggaggga gatggtggcc ctaacacagg gggaatggga 780 
gcctattgtc cagcccctca ggtttctaat gatctattac taaaaattaa agatactgtt 840 
. cttcagagga cagtggatgg catgcagcaa gagggtactc catatacagg tattctctat 900 
gctggaataa tgctgaccaa gaatggccca aaagttctag agtttaattg ccgttttggt 960 
gatccagagt gccaagtaat cctcccactt cttaaaagtg atctttatga agtgattcag 1020 
tccaccttag atggactgct ctgcacatct ctgcctgttt ggctagaaaa ccacaccgcc 1080 
ctaactgttg tcatggcaag taaaggttat cctggagact acaccaaggg tgtagagata 1140 , 
acagggtttc ctgaggctca agctctagga ctggaggtgt tccatgcagg cactgccctc 1200 
aaaaatggca aagtagtaac tcatgggggt agagttcttg cagtcacagc catccgggaa 1260 
aatctcatat cagcccttga ggaagccaag aaaggactag ctgctataaa gtttgaggga 1320 
gcaatttata ggaaagacgt cggctttcgt gccatagctt tcctccagca gcccaggagt 1380 
ttgacttaca aggaatctgg agtagatatc gcagctggaa atatgctggt caagaaaatt 1440 
cagcctttag caaaagccac ttccagatca ggctgtaaag ttgatcttgg aggttttgct 1500 
ggtctttttg atttaaaagc agctggtttc aaagatcccc ttctggcctc tggaacagat 1560 
ggcgttggaa ctaaactaaa gattgcccag ctatgcaata aacatgatac cattggtcaa 1620 
gatttggtag caatgtgtgt taatgatatt ctggcacaag gagcagagcc cctcttcttc 1680 
cttgattact tttcctgtgg aaaacttgac ctcagtgtaa ctgaagctgc tgttgctgga 1740 
attgctaaag cttgtggaaa agctggatgt gctctccttg gaggtgaaac agcagaaatg 1800 
cctgacatgt atccccctgg agagtatgac ctagctgggt ttgccgttgg tgccatggag 1860 
cgagatcaga aactccctca cctggaaaga atcactgagg gtgatgttgt tgttggaata 1920 
gcttcatctg gtcttcatag .caatggattt agccttgtga ggaaaatcgt tgcaaaatct 1980 
tccctccagt actcctctcc agcacctgat ggttgtggtg accagacttt aggggactta 2040 
cttctcacgc ctaccagaat ctacagccat tcactgttac ctgtcctacg ttcaggacat 2100 
gtcaaagcct ttgcccatat tactggtgga ggattactag agaacatccc cagagtcctc 2160 
cctgagaaac ttggggtaga tttagatgcc cagacctgga ggatccccag ggttttctca 2220 
tggttgcagc aggaaggaca cctctctgag gaagagatgg ccagaacatt taactgtggg 2280 
gttggcgctg tccttgtggt atcaaaggag cagacagagc agattctgag ggatatccag 2340 
cagcacaagg aagaagcctg ggtgattggc agtgtggttg cacgagctga aggttcccca 2400 
cgtgtgaaag tcaagaatct gattgaaagc atgcaaataa atgggtcagt gttgaagaat 2460 
ggctccctga caaatcattt ctcttttgaa aaaaaaaagg ccagagtggc tgtcttaata 2520 
tctggaacag gatcgaacct gcaagcactt atagacagta ctcgggaacc aaatagctct 2580 
gcacaaattg atattgttat ctccaacaaa gccgcagtag ctgggttaga taaagcggaa 2640 
agagctggta ttcccactag agtaattaat cataaactgt ataaaaatcg tgtagaattt 2700 
gacagtgcaa ttgacctagt ccttgaagag ttctccatag acatagtctg tcttgcagga 2760 
ttcatgagaa ttctttctgg cccctttgtc caaaagtgga atggaaaaat gctcaatatc 2820 
cacccatcct tgctcccttc ttttaagggt tcaaatgccc atgagcaagc cctggaaacc 2880 
ggagtcacag ttactgggtg cactgtacac tttgtagctg aagatgtgga tgctggacag 2940 
attattttgc aagaagctgt tcccgtgaag aggggtgata ctgtcgcaac tctttctgaat 3000 
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<400> 12 

accgggcaag cgggaaccag gtggccaccc ggtgtcggtt tcattttcct ttggaatttc 60 
tgctttacag acagaacaat ggcagcccga gtacttataa ttggcagtgg aggaagggaa 120 
catacgctgg cctggaaact tgcacagtct catcatgtca aacaagtgtt ggttgcccca 180 
ggaaacgcag gcactgcctg. ctctgaaaag atttcaaata ccgccatctc aatcagtgac 240 
cacactgccc ttgctcaatt ctgcaaagag aagaaaattg aatttgtagt tgttggacca 300 
gaagcacctc tggctgctgg gattgttggg aacctgaggt ctgcaggagt gcaatgcttt 360 
ggcccaacag cagaagcggc tcagttagag tccagcaaaa ggtttgccaa agagtttatg 420 
gacagacatg gaatcccaac cgcacaatgg aaggctttca ccaaacctga agaagcctgc 480 
agcttcattt tgagtgcaga cttccctgct ttggttgtga aggccagtgg tcttgcagct 540 
ggaaaagggg tgattgttgc aaagagcaaa gaagaggcct gcaaagctgt acaagagatc 600 
atgcaggaga aagcctttgg ggcagctgga gaaacaattg tcattgaaga acttcttgac 660 , 
ggagaagagg tgtcgtgtct gtgtttcact gatggcaaga ctgtggcccc catgccccca 720 
gcacaggacc ataagcgatt actggaggga gatggtggcc ctaacacagg gggaatggga 780 
gcctattgtc cagcccctca ggtttctaat gatctattac taaaaattaa agatactgtt 840 
cttcagagga cagtggatgg catgcagcaa gagggtactc catatacagg tattctctat 900 
gctggaataa tgctgaccaa gaatggccca aaagttctag agtttaattg ccgttttggt 960 
gatccagagt gccaagtaat cctcccactt cttaaaagtg atctttatga agtgactcag 1020 
tccaccttag atggactgct ctgcacatct ctgcctgttt ggctagaaaa ccacaccgcc 1080 
ctaactgttg tcatggcaag taaaggttat cctggagact acaccaaggg tgtagagata 1140 
acagggtttc ctgaggctca agctctagga ctggaggtgt tccatgcagg cactgccctc 1200 
aaaaatggca aagtagtaac tcatgggggt agagttcttg cagtcacagc catccgggaa 1260 
aatctcatat cagcccttga ggaagccaag aaaggactag ctgctataaa gtttgaggga 1320 
gcaatttata ggaaagacgt cggctttcgt gccatagctt tcctccagca gcccaggagt 1380 
ttgacttaca aggaatctgg agtagatatc gcagctggaa atatgctggt caagaaaatt 1440 
cagcctttag caaaagccac ttccagatca ggctgtaaag ttgatcttgg aggttttgct 1500 
ggtctttttg atttaaaagc agctggtttc aaagatcccc ttctggcctc tggaacagat 1560 
ggcgttggaa ctaaactaaa gattgcccag ctatgcaata aacatgatac cattggtcaa 1620 
gatttggtag caatgtgtgt taatgatatt ctggcacaag gagcagagcc cctcttcttc 1680 
cttgattact tttcctgtgg aaaacttgac ctcagtgtaa ctgaagctgt tgttgctgga 1740 
attgctaaag cttgtggaaa agctggatgt gctctccttg gaggtgaaac agcagaaatg 1800 
cctgacatgt atccccctgg agagtatgac ctagctgggt ttgccgttgg tgccatggag 1860 
cgagatcaga aactccctca cctggaaaga atcactgagg gtgatgttgt tgttggaata 1920 
gcttcatctg gtcttcatag caatggattt agccttgtga ggaaaatcgt tgcaaaatct 1980 
tccctccagt actcctctcc agcacctgat ggttgtggtg accagacttt aggggactta 2040 
cttctcacgc ctaccagaat ctacagccat tcactgttac ctgtcctacg ttcaggacat 2100 
gtcaaagcct ttgcccatat tactggtgga ggattactag agaacatccc cagagtcctc 2160 
cctgagaaac ttggggtaga tttagatgcc cagacctgga ggatccccag ggttttctca 2220 
tggttgcagc aggaaggaca cctctctgag gaagagatgg ccagaacatt taactgtggg 2280 
gttggcgctg tccttgtggt atcaaaggag cagacagagc agattctgag ggatatccag 2340 
cagcacaagg aagaagcctg ggtgattggc agtgtggttg cacgagctga aggttcccca 2400 
cgtgtgaaag tcaagaatct gattgaaagc atgcaaataa atgggtcagt gttgaagaat 2460 
ggctccctga caaatcattt ctcttttgaa aaaaaaaagg ccagagtggc tgtcttaata 2520 
tctggaacag gatcgaacct gcaagcactt atagacagta ctcgggaacc aaatagctct 2580 
gcacaaattg atattgttat ctccaacaaa gccgcagtag ctgggttaga taaagcggaa 2640 
agagctggta ttcccactag agtaattaat cataaactgt ataaaaatcg tgtagaattt 2700 
gacagtgcaa ttgacctagt ccttgaagag ttctccatag acatagtctg tcttgcagga 2760 
ttcatgagaa ttctttctgg cccctttgtc caaaagtgga atggaaaaat gctcaatatc 2820 
cacccatcct tgctcccttc ttttaagggt tcaaatgccc atgagcaagc cctggaaacc 2880 
ggagtcacag ttactgggtg cactgtacac tttgtagctg aagatgtgga tgctggacag 2940 
attattttgc aagaagctgt tcccgtgaag aggggtgata ctgtcgcaac tctttctgaal 3000 
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agagtaaaat tagcagaaca taaaatattt cctgcagccc ttcagctggt ggccagtgga 3060 
actgtacagc ttggagaaaa tggcaagatc tgttgggtta aagaggaatg aagcctttta 3120 
attcagaaat ggggccagtt tagaaagaat tatttgctgt ttgcatggtg gttttttatc 3180 
atggacttgg cccaaaagaa aaactgctaa aagacaaaaa agacctcacc cttacttcat 3240 
ctattttttt aataaataga gactcactaa aaaaaaaaaa aaaaaaaaaa a 3291 

<210> 13 
<211> 1776 
<212> DNA 

<213> Homo sapiens 
<400> 13 

atggtgccct ccagcccagc ggtggagaag caggtgcccg tggaacctgg gcctgacccc 60 
gagctccggt cctggcggcg cctcgtgtgc tacctttgct tctacggctt catggcgcag 120 
atacggccag gggagagctt catcaccccc tacctcctgg ggcccgacaa gaacttcacg 180 
cgggacgagg tcacgaacga gatcacgccg gtgctgtcgt actcctacct ggccgtgctg 240 
gtgcccgtgt tcctgctcac cgactacctg cgctacacgc cggtgctgct gctgcagggg 300 
ctcagcttcg tgtcggtgtg gctgctgctg ctgctgggcc actcggtggc gcacatgcag 360 
ctcatggagc tcttctac'ag cgtcaccatg gccgcgcgca tcgcctattc ctcctacatc 420 
ttctctctcg tgcggcccgc gcgctaccag cgtgtggccg gctactcgcg cgctgcggtg 480 
ctgctgggcg tgttcaccag ctccgtgctg ggccagctgc tggtcactgt gggccgagtc; 540 
tccttctcca cgctcaacta catctcgctg gccttcctca ccttcagcgt ggtcctcgcc 600 
ctcttcctga agcgccccaa gcgcagcctc ttcttcaacc gcgacgaccg ggggcggtgc 660 . 
gaaacctcgg cttcggagct ggagcgcatg aatcctggcc caggcgggaa gctgggacac 720 
gccctgcggg tggcctgtgg ggactcagtg ctggcgcgga tgctgcggga gctgggggac 780 
agcctgcggc ggccgcagct gcgcctgtgg tccctctggt gggtcttcaa ctcggccggc 840 
tactacctgg tggtctacta cgtgcacatc ctgtggaacg aggtggaccc caccaccaac 900 
agtgcgcggg tctacaacgg cgcggcagat gctgcctcca cgctgctggg cgccatcacg 960 
tccttcgccg cgggcttcgt gaagatccgc tgggcgcgct ggtccaagct gctcatcgcg 1020 
ggcgtcacgg ccacgcaggc ggggctggtc ttccttctgg cgcacacgcg ccacccgagc 1080 
agcatctggc tgtgctatgc ggccttcgtg ctgttccgcg gctcctacca gttcctcgtg 1140 
cccatcgcca cctttcagat tgcatcttct ctgtctaaag agctctgtgc cctggtcttc 1200 
ggggtcaaca cgttctttgc caccatcgtc aagaccatca tcactttcat tgtctcggac 126.0 
gtgcggggcc tgggcctccc ggtccgcaag cagttccagt tatactccgt gtacttcctg 1320 
atcctgtcca tcatctactt cttgggggcc atgctggatg gcctgcgcga ctgccagcgg 13 80 
ggccaccacc cgcggcagcc cccggcccag ggcctgagga gtgccgcgga ggagaaggca 1440 
gcacagcgac tgagcgtgca ggacaagggc ctcggaggcc tgcagccagc ccagagcccg 1500 
ccgctttccc cagaagacag cctgggggct gtggggccag cctccctgga gcagagacag 1560' 
agcgacccat acctggccca ggccccggcc ccgcaggcag ctgaattcct gagcccagtg 1620 
acaacccctt ccccctgcac tctgtcgtcc gcccaagcct caggccctga ggctgcagat 1680 
gagacttgtc cccagctggc tgtccatcct cctggtgtca gcaagctggg tttgcagtgt 1740 
cttccaagcg acggtgttca gaatgtgaac cagtga 177 6 

<210> 14 
<211> 2500 
<212> DNA 

<213> Homo sapiens 
<400> 14 

tgaatcgccc ggggtcgccg tctccgcctc gccgcagtcg gggcagccgc tgccctcttt 60 
tccatgtatc gtccaggatc ccatgacaga ttctgttgtc acgtctcctt acagagtttg 120 
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agagtaaaat tagcagaaca taaaatattt cctgcagccc ttcagctggt ggccagtgga 3060 
actgtacagc ttggagaaaa tggcaagatc tgttgggtta aagaggaatg aagcctttta 3120 
attcagaaat ggggccagtt tagaaagaat tatttgctgt ttgcatggtg gttttttatc 3180 
atggacttgg cccaaaagaa aaactgctaa aagacaaaaa agacctcacc cttacttcat 3240 
ctattttttt aataaataga gactcactaa aaaaaaaaaa aaaaaaaaaa a 3291 

<210> 13 

<211> 1776 

<212> DNA 

<213> Homo sapiens 

<400> 13 

atggtgccct ccagcccagc ggtggagaag caggtgcccg tggaacctgg gcctgacccc 60 
gagctccggt cctggcggcg cctcgtgtgc tacctttgct tctacggctt catggcgcag 120 
atacggccag gggagagctt catcaccccc tacctcctgg ggcccgacaa gaacttcacg 180 
cgggacgagg tcacgaacga gatcacgccg gtgctgtcgt actcctacct ggccgtgctg 240 
gtgcccgtgt tcctgctcac cgactacctg cgctacacgc cggtgctgct gctgcagggg 300 
ctcagcttcg tgtcggtgtg gctgctgctg ctgctgggcc actcggtggc gcacatgcag 360 
ctcatggagc tcttctacag cgtcaccatg gccgcgcgca tcgcctattc ctcctacatc 420 
ttctctctcg tgcggcccgc gcgctaccag cgtgtggccg gctactcgcg cgctgcggtg 480 
ctgctgggcg tgttcaccag ctccgtgctg ggccagctgc tggtcactgt gggccgagtc 540 
tccttctcca cgctcaacta catctcgctg gccttcctca ccttcagcgt ggtcctcgcc 600 
ctcttcctga agcgccccaa gcgcagcctc ttcttcaacc gcgacgaccg ggggcggtgc 660 
gaaacctcgg cttcggagct ggagcgcatg aatcctggcc caggcgggaa gctgggacac 720 
gccctgcggg tggcctgtgg ggactcagtg ctggcgcgga tgctgcggga gctgggggac 780 
agcctgcggc ggccgcagct gcgcctgtgg tccctctggt gggtcttcaa ctcggccggc 840 
tactacctgg tggtctacta cgtgcacatc ctgtggaacg aggtggaccc caccaccaac 900 
agtgcgcggg tctacaacgg cgcggcagat gctgcctcca cgctgctggg cgccatcacg 960 
tccttcgccg cgggcttcgt gaagatccgc tgggcgcgct ggtccaagct gctcatcgcg 1020 
ggcgtcacgg ccacgcaggc ggggctggtc ttccttctgg cgcacacgcg ccacccgagc 1080 
agcatctggc tgtgctatgc ggccttcgtg ctgttccgcg gctcctacca gttcctcgtg 1140 
cccatcgcca cctttcagat tgcatcttct ctgtctaaag agctctgtgc cctggtcttc 1200 
ggggtcaaca cgttctttgc caccatcgtc aagaccatca tcactttcat tgtctcggac 1260 
gtgcggggcc tgggcctccc ggtccgcaag cagttccagt tatactccgt gtacttcctg 1320 
atcctgtcca tcatctactt cttgggggcc atgctggatg gcctgcgcga ctgccagcgg 1380 
ggccaccacc cgcggcagcc cccggcccag ggcctgagga gtgccgcgga ggagaaggca 1440 
gcacagcgac tgagcgtgca ggacaagggc ctcggaggcc tgcagccagc ccagagcccg 1500 
ccgctttccc cagaagacag cctgggggct gtggggccag cctccctgga gcagagacag 1560 
agcgacccat acctggccca ggccccggcc ccgcaggcag ctgaattcct gagcccagtg 1620 
acaacccctt ccccctgcac tctgtcgtcc gcccaagcct caggccctga ggctgcagat 1680 
gagacttgtc cccagctggc tgtccatcct cctggtgtca gcaagctggg tttgcagtgt 1740 
cttccaagcg acggtgttca gaatgtgaac cagtga 1776 

<210> 14 

<211> 2500 

<212> DNA 

<213> Homo sapiens 

<400> 14 

tgaatcgccc ggggtcgccg tctccgcctc gccgcagtcg gggcagccgc tgccctcttt 60 
tccatgtatc gtccaggatc ccatgacaga ttctgttgtc acgtctcctt acagagtttg 120 
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agcggtgctg aactgtcagc acatctgtcc ggtccagcat gccttctgag accccccagg 180 

cagaagtggg gcccacaggc tgcccccacc gctcagggcc acactcggcg aaggggagcc 240 

tggagaaggg gtccccagag gataaggaag ccaaggagcc cctgtggatc cggcccgatg 300 

ctccgagcag gtgcacctgg cagctgggcc ggcctgcctc cgagtcccca catcaccaca 360 

ctgccccggc aaaatctcca aaaatcttgc cagatattct gaagaaaatc ggggacaccc 420 

ctatggtcag aatcaacaag attgggaaga agttcggcct gaagtgtgag ctcttggcca 480 

agtgtgagtt cttcaacgcg ggcgggagcg tgaaggaccg catcagcctg cggatgattg 540 

aggatgctga gcgcgacggg acgctgaagc ccggggacac gattatcgag ccgacatccg .600 

ggaacaccgg gatcgggctg gccctggctg cggcagtgag gggctatcgc tgcatcatcg 660 

tgatgccaga gaagatgagc tccgagaagg tggacgtgct gcgggcactg ggggctgaga 720 

ttgtgaggac gcccaccaat gccaggttcg actccccgga gtcacacgtg ggggtggcct 780 

ggcggctgaa gaacgaaatc cccaattctc acatcctaga ccagtaccgc aacgccagca 840 

accccctggc tcactacgac accaccgctg atgagatcct gcagcagtgt gatgggaagc 900 

tggacatgct ggtggcttca gtgggcacgg gcggcaccat cacgggcatt gccaggaagc 960 

tgaaggagaa gtgtcctgga tgcaggatca ttggggtgga tcccgaaggg tccatcctcg 1020 

cagagccgga ggagctgaac cagacggagc agacaaccta cgaggtggaa gggatcggct 1080 

acgacttcat ccccacggtg ctggacagga cggtggtgga caagtggttc aagagcaacg 1140 

atgaggaggc gttcaccttt gcccgcatgc tgatcgcgca agaggggctg ctgtgcggtg 1200 

gcagtgctgg cagcacggtg gcggtggccg tgaaggctgc gcaggagctg caggagggcc 1260 

agcgctgcgt ggtcattctg cccgactcag tgcggaacta catgaccaag ttcctgagcg 1320 

acaggtggat gctgcagaag ggctttctga aggaggagga cctcacggag aagaagccct 1380 

ggtggtggca cctccgtgtt caggagctgg gcctgtcagc cccgctgacc gtgctcccga 1440 

ccatcacctg tgggcacacc atcgagatcc tccgggagaa gggcttcgac caggcgcccg 1500 

tggtggatga ggcgggggta atcctgggaa tggtgacgct tgggaacatg ctctcgtccc 1560 

tgcttgccgg gaaggtgcag ccgtcagacc aagttggcaa agtcatctac aagcagttca 1620 

aacagatccg cctcacggac acgctgggca ggctctcgca catcctggag atggaccact 1680 

tcgccctggt ggtgcacgag cagatccagt accacagcac cgggaagtcc agtcagcggc 1740 

agatggtgtt cggggtggtc accgccattg acttgctgaa cttcgtggcc gcccaggagc 1800 

gggaccagaa gtgaagtccg gagcgctggg cggtgcggag cgggcccgcc acccttgccc 1860 

acttctcctt cgctttcctg agccctaaac acacgcgtga ttggtaactg cctggcctgg 1920 

caccgttatc cctgcagacg gcacagagca tccgtctccc ctcgttaaca catggcttcc 1980 

taaatggccc tgtttacggc ctatgagatg aaatatgtga ttttctctaa tgtaacttcc 2040 

tcttaggatg tttcaccaag gaaatattga gagagaagtc ggccaggtag gatgaacaca 2100 

ggcaatgact gcgcagagtg gattaaaggc aaaagagaga agagtccagg aaggggcggg 2160 

gagaagcctg ggtggctcag catcctccac gggctgcgcg tctgctcggg gctgagctgg 2220 

cgggagcagt ttgcgtgttt gggtttttta attgagatga aattcaaata acctaaaaat 2280 

caateacttg aaagtgaaca atcagcggca tttagtacat ccagaaagtt gtgtaggcac 2340 

cacctctgtc acgttctgga acattctgtc atcaccccgt gaagcaatca tttcccctcc 2400 
cgtcttcctc ctcccctggc aactgctgat cgactttgtg tctctgttgt ctaaaatagg 2460 

ttttccctgt tctggacatt tcatataaat ggaatcacac 2500 

<210> 15 
<211> 2068 
<212> DNA 

<213> Homo sapiens 
<400> 15 

cggcagccct cctacctgcg cacgtggtgc cgctgctgct gcctcccgct cgccctgaac 60 
ccagtgcctg cagccatggc tcccggccag ctcgccttat ttagtgtctc tgacaaaacc 120 
ggccttgtgg aatttgcaag aaacctgacc gctcttggtt tgaatctggt cgcttccgga 180 
gggactgcaa aagctctcag ggatgctggt ctggcagtca gagatgtctc tgagttgacg 240 
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agcggtgctg aactgtcagc acatctgtcc ggtccagcat gccttctgag accccccagg 180 
cagaagtggg gcccacaggc tgcccccacc gctcagggcc acactcggcg aaggggagcc 240 
tggagaaggg gtccccagag gataaggaag ccaaggagcc cctgtggatc cggcccgatg 300 
ctccgagcag gtgcacctgg cagctgggcc ggcctgcctc cgagtcccca catcaccaca 360 
ctgccccggc aaaatctcca aaaatcttgc cagatattct gaagaaaatc ggggacaccc 420 
ctatggtcag aatcaacaag attgggaaga agttcggcct gaagtgtgag ctcttggcca 480 
agtgtgagtt cttcaacgcg ggcgggagcg tgaaggaccg catcagcctg cggatgattg 540 
aggatgctga gcgcgacggg acgctgaagc ccggggacac gattatcgag ccgacatccg 600 
ggaacaccgg gatcgggctg gccctggctg cggcagtgag gggctatcgc tgcatcatcg 660 
tgatgccaga gaagatgagc tccgagaagg tggacgtgct gcgggcactg ggggctgaga 720 
ttgtgaggac gcccaccaat gccaggttcg actccccgga gtcacacgtg ggggtggcct 780 
ggcggctgaa gaacgaaatc cccaattctc acatcctaga ccagtaccgc aacgccagca 840 
accccctggc tcactacgac accaccgctg atgagatcct gcagcagtgt gatgggaagc 900. 
tggacatgct ggtggcttca gtgggcacgg gcggcaccat cacgggcatt gccaggaagc 960 
tgaaggagaa gtgtcctgga tgcaggatca ttggggtgga tcccgaaggg tccatcctcg 1020 
cagagccgga ggagctgaac cagacggagc agacaaccta cgaggtggaa gggatcggct 1080 
acgacttcat ccccacggtg ctggacagga cggtggtgga caagtggttc aagagcaacg 1140 
atgaggaggc gttcaccttt gcccgcatgc tgatcgcgca agaggggctg ctgtgcggtg 1200 
gcagtgctgg cagcacggtg gcggtggccg tgaaggctgc gcaggagctg caggagggcc 12 60 
agcgctgcgt ggtcattctg cccgactcag tgcggaacta catgaccaag ttcctgagcg 1320 
acaggtggat gctgcagaag ggctttctga aggaggagga cctcacggag aagaagccct 1380 
ggtggtggca cctccgtgtt caggagctgg gcctgtcagc cccgctgacc gtgctcccga 1440 
ccatcacctg tgggcacacc atcgagatcc tccgggagaa gggcttcgac caggcgcccg 1500 
tggtggatga ggcgggggta atcctgggaa tggtgacgct tgggaacatg ctctcgtccc 1560 
tgcttgccgg gaaggtgcag ccgtcagacc aagttggcaa agtcatctac aagcagttca 1620 
aacagatccg cctcacggac acgctgggca ggctctcgca catcctggag atggaccact 1680 
tcgccctggt ggtgcacgag cagatccagt accacagcac cgggaagtcc agtcagcggc 1740 
agatggtgtt cggggtggtc accgccattg acttgctgaa cttcgtggcc gcccaggagc 1800 
gggaccagaa gtgaagtccg gagcgctggg cggtgcggag cgggcccgcc acccttgccc 1860 
acttctcctt cgctttcctg agccctaaac acacgcgtga ttggtaactg cctggcctgg 1920 
caccgttatc cctgcagacg gcacagagca tccgtctccc ctcgttaaca catggcttcc 1980 
taaatggccc tgtttacggc ctatgagatg aaatatgtga ttttctctaa tgtaacttcc 2040 
tcttaggatg tttcaccaag gaaatattga gagagaagtc ggccaggtag gatgaacaca 2100 
ggcaatgact gcgcagagtg gattaaaggc aaaagagaga agagtccagg aaggggcggg 2160 
gagaagcctg ggtggctcag catcctccac gggctgcgcg tctgctcggg gctgagctgg 2220 
cgggagcagt ttgcgtgttt gggtttttta attgagatga aattcaaata acctaaaaat 2280 
caatcacttg aaagtgaaca atcagcggca tttagtacat ccagaaagtt gtgtaggcac 2340 
cacctctgtc acgttctgga acattctgtc atcaccccgt gaagcaatca tttcccctcc 2400 
cgtcttcctc ctcccctggc aactgctgat cgactttgtg tctctgttgt ctaaaatagg 2460 
ttttccctgt tctggacatt tcatataaat ggaatcacac 2500 

<210> 15 

<211> 2068 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cggcagccct cctacctgcg cacgtggtgc cgctgctgct 
ccagtgcctg cagccatggc tcccggccag ctcgccttat 
ggccttgtgg aatttgcaag aaacctgacc gctcttggtt 
gggactgcaa aagctctcag ggatgctggt ctggcagtca 



gcctcccgct cgccctgaac 60 
ttagtgtctc tgacaaaacc 120 
tgaatctggt cgcttccgga 180 
gagatgtctc tgagttgacg 240 
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ggatttcctg aaatgttggg gggacgtgtg aaaactttgc atcctgcagt ccatgctgga 300 

atcctagctc gtaatattcc agaagataat gctgacatgg ccagacttga tttcaatctt 360 

ataagagttg ttgcctgcaa tctctatccc tttgtaaaga cagtggcttc tccaggtgta 420 

actgttgagg aggctgtgga gcaaattgac attggtggag taaccttact gagagctgca 480 

gccaaaaacc acgctcgagt gacagtggtg tgtgaaccag aggactatgt ggtggtgtcc 540 

acggagatgc agagctccga gagtaaggac acctccttgg agactagacg ccagttagcc 600 

ttgaaggcat tcactcatac ggcacaatat gatgaagcaa tttcagatta tttcaggaaa 660 

cagtacagca aaggcgtatc tcagatgccc ttgagatatg gaatgaaccc acatcagacc 720 

cctgcccagc tgtacacact gcagcccaag cttcccatca cagttctaaa tggagcccct 780 

ggatttataa acttgtgcga tgctttgaac gcctggcagc tggtgaagga actcaaggag 840 

gctttaggta ttccagccgc tgcctctttc aaacatgtca gcccagcagg tgctgctgtt 900 

ggaattccac tcagtgaaga tgaggccaaa gtctgcatgg tttatgatct ctataaaacc 960 

ctcacaccca tctcagcggc atatgcaaga gcaagagggg ctgataggat gtcttcattt 1020 

ggtgattttg ttgcattgtc cgatgtttgt gatgtaccaa ctgcaaaaat tatttccaga 1080 

gaagtatctg atggtataat tgccccagga tatgaagaag aagccttgac aatactttcc 1140 

aaaaagaaaa atggaaacta ttgtgtcctt cagatggacc aatcttacaa accagatgaa 1200 

aatgaagttc gaactctctt tggtcttcat ttaagccaga agagaaataa tggtgtcgtc 1260 

gacaagtcat tatttagcaa tgttgttacc aaaaataaag atttgccaga gtctgccctc 1320 

cgagacctca tcgtagccac cattgctgtc aagtacactc agtctaactc tgtgtgctac 1380 

gccaagaacg ggcaggttat cggcattgga gcaggacagc agtctcgtat acactgcact 1440 

cgccttgcag gagataaggc aaactattgg tggcttagac accatccaca agtgctttcg 1500 

atgaagttta aaacaggagt gaagagagca gaaatctcca atgccatcga tcaatatgtg 1560 

actggaacca ttggcgagga tgaagatttg ataaagtgga aggcactgtt tgaggaagtc 1620 

cctgagttac tcactgaggc agagaagaag gaatgggttg agaaactgac tgaagtttct 1680 

atcagctctg atgccttctt ccctttccga gataacgtag acagagctaa aaggagtggt 1740 

gtggcgtaca ttgcggctcc ctccggttct gctgctgaca aagttgtgat tgaggcctgc 1800 

gacgaactgg gaatcatcct cgctcatacg aaccttcggc tcttccacca ctgattttac 1860 

cacacactgt tttttggctt gcttatgtgt aggtgaacag tcacgcctga aactttgagg 1920 

ataacttttt aaaaaaataa aacagtatct cttaaaacaa tgttttgatc tacataaaca 1980 

ttgtaaaaat tttcaatcac gctttttaac tttcttacca caaaaaaatg ataagtgggt 2040 
gaagtgatgg ttatgttaat tagcgtgc 2068 

<210> 16 
<211> 85.7 
<212> DNA 

<213> Homo sapiens 
<400> 16 

gcgtgggcgt gagatggcgg cggcagcggt gagcagcgcc aagcggagcc 
gctgaagcag cgtctgcggg cgatgagtgc cgaggagcgg ctacgccagt 
gagccagaag gtgattgccc acagtgagta tcaaaagtcc aaaagaattt 
gagcatgcaa gatgaaattg agacagaaga gatcatcaag gacattttcc 
aatctgcttc atccctcggt accggttcca gagcaatcac atggatatgg 
atcaccagag gaaatttctt tacttcccaa aacatcctgg aatatccctc 
gggtgatgtt cgggaggagg ccttgtccac agggggactt gatctcatct 
ccttgggttt gacaaacatg gcaaccgact ggggaggggc aagggctact 
tctgaagcgc tgtttgcagc atcaggaagt gaagccctac accctggcgt 
agaacagatt tgcctccagg tcccagtgaa tgaaaacgac atgaaggtag 
ttacgaagac tcgtcaacag cttaaatctg gattactaca gccaaataat 
tatgagagta aagcaaagta tgtgtatttt tcccttgtca aaaattagtt 
attaatgtga atacagactg cattttaaaa ttgtaattat gaaatacctt 



tgcggggaga 60 
cccgcgtact 120 
ccatctttct 180 
aacgaggcaa 240 
tgagaataga 300 
agcctggtga 360 
tcatgccagg 420 
atgatgccta 480 
tggctttcaa 540 
atgaagtcct 600 
cagtgtttta 660 
gaaattgttc 720 
atataaaacc 780 
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ggatttcctg 
atcctagctc 
ataagagttg 
actgttgagg 
gccaaaaacc 
acggagatgc 
ttgaaggcat 
cagtacagca 
cctgcccagc 
ggatttataa 
gctttaggta 
ggaattccac 
ctcacaccca 
ggtgattttg 
gaagtatctg 
aaaaagaaaa 
aatgaagttc 
gacaagtcat 
cgagacctca 
gccaagaacg 
cgccttgcag 
atgaagttta 
actggaacca 
cctgagttac 
atcagctctg 
gtggcgtaca 
gacgaactgg 
cacacactgt 
ataacttttt 
ttgtaaaaat 
gaagtgatgg 



aaatgttggg 
gtaatattcc 
ttgcctgcaa 
aggctgtgga 
acgctcgagt 
agagctccga 
tcactcatac 
aaggcgtatc 
tgtacacact 
acttgtgcga 
ttccagccgc 
tcagtgaaga 
tctcagcggc 
ttgcattgtc 
atggtataat 
atggaaacta 
gaactctctt 
tatttagcaa 
tcgtagccac 
ggcaggttat 
gagataaggc 
aaacaggagt 
ttggcgagga 
tcactgaggc 
atgccttctt 
ttgcggctcc 
gaatcatcct 
tttttggctt 
aaaaaaataa 
tttcaatcac 
ttatgttaat 



gggacgtgtg 
agaagataat 
tctctatccc 
gcaaattgac 
gacagtggtg 
gagtaaggac 
ggcacaatat 
tcagatgccc 
gcagcccaag 
tgctttgaac 
tgcctctttc 
tgaggccaaa 
atatgcaaga 
cgatgtttgt 
tgccccagga 
ttgtgtcctt 
tggtcttcat 
tgttgttacc 
cattgctgtc 
cggcattgga 
aaactattgg 
gaagagagca 
tgaagatttg 
agagaagaag 
ccctttccga 
ctccggttct 
cgctcatacg 
gcttatgtgt 
aacagtatct 
gctttttaac 
tagcgtgc 



aaaactttgc 
gctgacatgg 
tttgtaaaga 
attggtggag 
tgtgaaccag 
acctccttgg 
gatgaagcaa 
ttgagatatg 
cttcccatca 
gcctggcagc 
aaacatgtca 
gtctgcatgg 
gcaagagggg 
gatgtaccaa 
tatgaagaag 
cagatggacc 
ttaagccaga 
aaaaataaag 
aagtacactc 
gcaggacagc 
tggcttagac 
gaaatctcca 
ataaagtgga 
gaatgggttg 
gataacgtag 
gctgctgaca 
aaccttcggc 
aggtgaacag 
cttaaaacaa 
tttcttacca 



atcctgcagt 
ccagacttga 
cagtggcttc 
taaccttact 
aggactatgt 
agactagacg 
tttcagatta 
gaatgaaccc 
cagttctaaa 
tggtgaagga 
gcccagcagg 
tttatgatct 
ctgataggat 
ctgcaaaaat 
aagccttgac 
aatcttacaa 
agagaaataa 
atttgccaga 
agtctaactc 
agtctcgtat 
accatccaca 
atgccatcga 
aggcactgtt 
agaaactgac 
acagagctaa 
aagttgtgat 
tcttccacca 
tcacgcctga 
tgttttgatc 
caaaaaaatg 



ccatgctgga 
tttcaatctt 
tccaggtgta 
gagagctgca 
ggtggtgtcc 
ccagttagcc 
tttcaggaaa 
acatcagacc 
tggagcccct 
actcaaggag 
tgctgctgtt 
ctataaaacc 
gtcttcattt 
tatttccaga 
aatactttcc 
accagatgaa 
tggtgtcgtc 
gtctgccctc 
tgtgtgctac 
acactgcact 
agtgctttcg 
tcaatatgtg 
tgaggaagtc 
tgaagtttct 
aaggagtggt 
tgaggcctgc 
ctgattttac 
aactttgagg 
tacataaaca 
ataagtgggt 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2068 



<210> 16 
<211> 857 
<212> DNA 

<213> Homo sapiens 



<400> 16 

gcgtgggcgt gagatggcgg cggcagcggt 
gctgaagcag cgtctgcggg cgatgagtgc 
gagccagaag gtgattgccc acagtgagta 
gagcatgcaa gatgaaattg agacagaaga 
aatctgcttc atccctcggt accggttcca 
atcaccagag gaaatttctt tacttcccaa 
gggtgatgtt cgggaggagg ccttgtccac 
ccttgggttt gacaaacatg gcaaccgact 
tctgaagcgc tgtttgcagc atcaggaagt 
agaacagatt tgcctccagg tcccagtgaa 
ttacgaagac tcgtcaacag cttaaatctg 
tatgagagta aagcaaagta tgtgtatttt 
attaatgtga atacagactg cattttaaaa 



gagcagcgcc aagcggagcc tgcggggaga 60 
cgaggagcgg ctacgccagt cccgcgtact 120 
tcaaaagtcc aaaagaattt ccatctttct 180 
gatcatcaag gacattttcc aacgaggcaa 240 
gagcaatcac atggatatgg tgagaataga 300 
aacatcctgg aatatccctc agcctggtga 360 
agggggactt gatctcatct tcatgccagg 420 
ggggaggggc aagggctact atgatgccta 480 
gaagccctac accctggcgt tggctttcaa 540 
tgaaaacgac atgaaggtag atgaagtcct 600 
gattactaca gccaaataat cagtgtttta 660 
tcccttgtca aaaattagtt gaaattgttc 720 
ttgtaattat gaaatacctt atataaaacc 780 
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atctttaaaa accaatagaa gtgtgaatag tagaatatta attaaaatgg aggctatcag 840 
cctgtgattt tcagctt 857 

<210> 17 

<211> 3762 

<212> DNA 

<213> Homo sapiens 

<400> 17 

cccgcgagcg tccatccatc tgtccggccg actgtccagc gaaaggggct ccaggccggg 60 
cgcacgtcga cccgggggac cgaggccagg agaggggcca agagcgcggc tgacccttgc 120 
gggccggggc aggggacggt ggccgcggcc atgcagtcct gtgccagggc gtgggggctg 180 
cgcctgggcc gcggggtcgg gggcggccgc cgcctggctg ggggatcggg gccgtgctgg 240 
gcgccgcgga gccgggacag cagcagtggc ggcggggaca gcgccgcggc tggggcctcg 300 
cgcctcctgg agcgccttct gcccagacac gacgacttcg ctcggaggca catcggccct 360 
ggggacaaag accagagaga gatgctgcag accttggggc tggcgagcat tgatgaattg 420 
atcgagaaga cggtccctgc caacatccgt ttgaaaagac ccttgaaaat ggaagaccct 480 
gtttgtgaaa atgaaatcct tgcaactctg catgccattt caagcaaaaa ccagatctgg 540 
agatcgtata ttggcatggg ctattataac tgctcagtgc cacagacgat tttgcggaac 600 
ttactggaga actcaggatg gatcacccag tatactccat accagcctga ggtgtctcag 660 
gggaggctgg agagtttact caactaccag accatggtgt gtgacatcac aggcctggac 720 
atggccaatg catccctgct ggatgagggg actgcagccg cagaggcact gcagctgtgc 780 
tacagacaca acaagaggag gaaatttctc gttgatcccc gttgccaccc acagacaata 840 
gctgttgtcc agactcgagc caaatatact ggagtcctca- ctgagctgaa gttaccctgt 900 
gaaatggact tcagtggaaa agatgtcagt ggagtgttgt tccagtaccc agacacggag 960 
gggaaggtgg aagactttac ggaactcgtg gagagagctc atcagagtgg gagcctggcc 1020 
tgctgtgcta ctgacctttt agctttgtgc atcttgaggc cacctggaga atttggggta 1080 
gacatcgccc tgggcagctc ccagagattt ggagtgccac tgggctatgg gggaccccat 1140 
gcagcatttt ttgctgtccg agaaagcttg gtgagaatga tgcctggaag aatggtgggg 1200 
gtaacaagag atgccactgg gaaagaagtg tatcgtcttg ctcttcaaac cagggagcaa 1260 
cacattcgga gagacaaggc taccagcaac atctgtacag ctcaggccct cttggcgaat 1320 
atggctgcca tgtttcgaat ctaccatggt tcccatgggc tggagcatat tgctaggagg 1380 
gtacataatg ccactttgat tttgtcagaa ggtctcaagc gagcagggca tcaactccag 1440 
catgacctgt tctttgatac cttgaagatt cattgtggct gctcagtgaa ggaggtcttg 1500 
ggcagggcgg ctcagcggca gatcaatttt cggctttttg aggatggcac acttggtatt 1560 
tctcttgatg aaacagtcaa tgaaaaagat ctggacgatt tgttgtggat ctttggttgt 1620 
gagtcatctg cagaactggt tgctgaaagc atgggagagg agtgcagagg tattccaggg 1680 
tctgtgttca agaggaccag cccgttcctc acccatcaag tgttcaacag ctaccactct 1740 
gaaacaaaca ttgtccggta catgaagaaa ctggaaaata aagacatttc ccttgttcac 1800 
agcatgattc cactgggatc ctgcaccatg aaactgaaca gttcgtctga actcgcacct 1860 
atcacatgga aagaatttgc aaacatccac ccctttgtgc ctctggatca agctcaagga 1920 
tatcagcagc ttttccgaga gcttgagaag gatttgtgtg aactcacagg ttatgaccag 1980 
gtctgtttcc agccaaacag cggagcccag ggagaatatg ctggactggc cactatccga 2040 
gcctacttaa accagaaagg agaggggcac agaacggttt gcctcattcc gaaatcagca 2100 
catgggacca acccagcaag tgcccacatg gcaggcatga agattcagcc tgtggaggtg 2160 
gataaatatg ggaatatcga tgcagttcac ctcaaggcca tggtggataa gcacaaggag 2220 
aacctagcag ctatcatgat tacataccca tccaccaatg gggtgtttga agagaacatc 2280 
agtgacgtgt gtgacctcat ccatcaacat ggaggacagg tctacctaga cggggcaaat 2340 
atgaatgctc aggtgggaat ctgtcgccct ggagacttcg ggtctgatgt ctcgcaccta 2400 
aatcttcaca agaccttctg cattccccac ggaggaggtg gtcctggcat ggggcccatc 2460 
ggagtgaaga aacatctcgc cccgtttttg cccaatcatc ccgtcatttc actaaagcgg 2520 
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atctttaaaa accaatagaa gtgtgaatag tagaatatta attaaaatgg aggctatcag 840 
cctgtgattt tcagctt 857 

<210> 17 

<211> 3762 

<212> DNA 

<213> Homo sapiens 

<400> 17 

cccgcgagcg tccatccatc tgtccggccg actgtccagc gaaaggggct ccaggccggg .60 
cgcacgtcga cccgggggac cgaggccagg agaggggcca agagcgcggc tgacccttgc 120 
gggccggggc aggggacggt ggccgcggcc atgcagtcct gtgccagggc gtgggggctg 180 
cgcctgggcc gcggggtcgg gggcggccgc cgcctggctg ggggatcggg gccgtgctgg 240 
gcgccgcgga gccgggacag cagcagtggc ggcggggaca gcgccgcggc tggggcctcg 300 
cgcctcctgg agcgccttct gcccagacac gacgacttcg ctcggaggca catcggccct 360 
ggggacaaag accagagaga gatgctgcag accttggggc tggcgagcat tgatgaattg 420 
atcgagaaga cggtccctgc caacatccgt ttgaaaagac ccttgaaaat ggaagaccct 480 
gtttgtgaaa atgaaatcct tgcaactctg catgccattt caagcaaaaa ccagatctgg 540 
agatcgtata ttggcatggg ctattataac tgctcagtgc cacagacgat tttgcggaac 600 
ttactggaga actcaggatg gatcacccag tatactccat accagcctga ggtgtctcag 660 
gggaggctgg agagtttact caactaccag accatggtgt gtgacatcac aggcctggac 720. 
atggccaatg catccctgct ggatgagggg actgcagccg cagaggcact gcagctgtgc 7 80 
tacagacaca acaagaggag gaaatttctc gttgatcccc gttgccaccc acagacaata 840 
gctgttgtcc agactcgagc caaatatact ggagtcctca ctgagctgaa gttaccctgt 900 
gaaatggact tcagtggaaa agatgtcagt ggagtgttgt tccagtaccc agacacggag 960 
gggaaggtgg aagactttac ggaactcgtg gagagagctc atcagagtgg gagcctggcc 1020 
tgctgtgcta ctgacctttt agctttgtgc atcttgaggc cacctggaga atttggggta 1080 
gacatcgccc tgggcagctc ccagagattt ggagtgccac tgggctatgg gggaccccat 1140 
gcagcatttt ttgctgtccg agaaagcttg gtgagaatga tgcctggaag aatggtgggg 1200 
gtaacaagag atgccactgg gaaagaagtg tatcgtcttg ctcttcaaac cagggagcaa 1260 
cacattcgga gagacaaggc taccagcaac atctgtacag ctcaggccct cttggcgaat 1320 
atggctgcca tgtttcgaat ctaccatggt tcccatgggc tggagcatat tgctaggagg 1380 
gtacataatg ccactttgat tttgtcagaa ggtctcaagc gagcagggca tcaactccag 1440 
catgacctgt tctttgatac cttgaagatt cattgtggct gctcagtgaa ggaggtcttg 1500 
ggcagggcgg ctcagcggca gatcaatttt cggctttttg aggatggcac acttggtatt 1560 
tctcttgatg aaacagtcaa tgaaaaagat ctggacgatt tgttgtggat ctttggttgt 1620 
gagtcatctg cagaactggt tgctgaaagc atgggagagg agtgcagagg tattccaggg 1680 
tctgtgttca agaggaccag cccgttcctc acccatcaag tgttcaacag ctaccactct 1740 
gaaacaaaca ttgtccggta catgaagaaa ctggaaaata aagacatttc ccttgttcac 1800 
agcatgattc cactgggatc ctgcaccatg aaactgaaca gttcgtctga actcgcacct 1860 
atcacatgga aagaatttgc aaacatccac ccctttgtgc ctctggatca agctcaagga 1920 
tatcagcagc ttttccgaga gcttgagaag gatttgtgtg aactcacagg ttatgaccag 1980 
gtctgtttcc agccaaacag cggagcccag ggagaatatg ctggactggc cactatccga 2040 
gcctacttaa accagaaagg agaggggcac agaacggttt gcc teat tec gaaatcagca 2100 
catgggacca acccagcaag tgcccacatg gcaggcatga agattcagee tgtggaggtg 2160 
gataaatatg ggaatatcga tgcagttcac ctcaaggcca tggtggataa gcacaaggag 2220 
aacctagcag ctatcatgat tacataccca tccaccaatg gggtgtttga agagaacatc 2280 
agtgacgtgt gtgacctcat ccatcaacat ggaggacagg tctacctaga eggggcaaat 2340 
atgaatgetc aggtgggaat ctgtcgccct ggagacttcg ggtctgatgt ctcgcaccta 2400 
aatcttcaca agaccttctg cattccccac ggaggaggtg gtcctggcat ggggcccatc 2460 
ggagtgaaga aacatctcgc cccgtttttg cccaatcatc ccgtcatttc actaaagegg 2520 
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aatgaggatg cctgtcctgt gggaaccgtc agtgcggccc catggggctc cagttccatc 2580 
ttgcccattt cctgggctta tatcaagatg atgggaggca agggtcttaa acaagccacg 2640 
gaaactgcga tattaaatgc caactacatg gccaagcgat tagaaacaca ctacagaatt 2700 
cttttcaggg gtgcaagagg ttatgtgggt catgaattta ttttggacac gagacccttc 2760 
aaaaagtctg caaatattga ggctgtggat gtggccaaga gactccagga ttatggattt 2820 
cacgccccta ccatgtcctg gcctgtggca gggaccctca tggtggagcc cactgagtcg 2880 
gaggacaagg cagagctgga cagattctgt gatgccatga tcagcattcg gcaggaaatt 2940 
gctgacattg aggagggccg catcgacccc agggtcaatc cgctgaagat gtctccacac 3000 
tccctgacct gcgttacatc ttcccactgg gaccggcctt attccagaga ggtggcagca 3060 
ttcccactcc ccttcatgaa accagagaac aaattctggc caacgattgc ccggattgat 3120 
gacatatatg gagatcagca cctggtttgt acctgcccac ccatggaagt ttatgagtct 3180 
ccattttctg aacaaaagag ggcgtcttct tagtcctctc tccctaagtt taaaggactg 3240 
atttgatgcc tctccccaga gcatttgata agcaagaaag atttcatctc ccaccccagc 3300 
ctcaagtagg agttttatat actgtgtata tctctgtaat ctctgtcaag gtaaatgtaa 3360 
atacagtagc tggagggagt cgaagctgat ggttggaaga cggatttgct ttggtattct 3420 
gcttccacat gtgccagttg cctggattgg gagccatttt gtgttttgcg tagaaagttt 3480 
taggaacttt aacttttaat gtggcaagtt tgcagatgtc atagaggcta tcctggagac 3 540 
ttaatagaca tttttttgtt ccaaaagagt ccatgtggac tgtgccatct gtgggaaatc 3600- 
ccagggcaaa tgtttacatt ttgtataccc tgaagaactc tttttcctct aatatgccta 3660 
atctgtaatc acatttctga gtgttttcct ctttttctgt gtgaggtttt tttttttttt 3720 
aatctgcatt tattagtatt ctaataaaag cattttgatc gg 3762 

<210> 18 
<211> 1192 
<212> DNA 

<213> Homo sapiens 
<400> 18 

ggctccctcc ggccgcgaac tgcccctccc cgccccgcct cccggcgcgg gtggccgagg 60 

cgtagcgccg cgacccccgc acccctgcga acatggcgct gcgagtggtg cggagcgtgc 120 

gggccctgct ctgcaccctg cgcgcggtcc cgttacccgc cgcgccctgc ccgccgaggc 180 

cctggcagct gggggtgggc gccgtccgta cgctgcgcac tggacccgct ctgctctcgg 240 

tgcgtaaatt cacagagaaa cacgaatggg taacaacaga aaatggcatt ggaacagtgg 300 

gaatcagcaa ttttgcacag gaagcgttgg gagatgttgt ttattgtagt ctccctgaag 360 

ttgggacaaa attgaacaaa caagatgagt ttggtgcttt ggaaagtgtg aaagctgcta 420 

gtgaactata ttctccttta tcaggagaag taactgaaat taatgaagct cttgcagaaa 480 

atccaggact tgtaaacaaa tcttgttatg aagatggttg gctgatcaag atgacactga 540 

gtaacccttc agaactagat gaacttatga gtgaagaagc atatgagaaa tacataaaat 600 

ctattgagga gtgaaaatgg aactcctaaa taaactagta tgaaataacg aagccagcag 660 

agttgtctta aattagtggt ggatagagac ttagaataga aacttttagt attaccgatg 720 

gggcaaaaaa aaactactgt taacactgct aatgaaagaa aatgcccttt aactttgtaa 780 

tgattataga taaatataat atgcgtcttt ttcacaatat cctatgattt ttagactagg 840 

ctctagtgtt cagaattcat gaaattatcc atggtaaaaa ctagttataa aaattacata 900 

attcaaagat aacattgtta ttcttaagcc ttatataata ttgtaacttg catgtatcca 960 

tacctggatt tgggatgaaa tacttaatga tctttccatt ggaaataact ggaagtgaag 1020 

aggttttgtt gcttgtacag tgtcagatga ggaacaccac tatcttaatt ttgcgataca 1080 

ctgcatttgc tggtgctatt tttatacagt gaagcaacag ctttgcagca aaataataaa 1140 

atacttcttc gttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1192 

<210> 19 
<211> 2102 
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aatgaggatg cctgtcctgt gggaaccgtc agtgcggccc catggggctc cagttccatc 2580 
ttgcccattt cctgggctta tatcaagatg atgggaggca agggtcttaa acaagccacg 2640 
gaaactgcga tattaaatgc caactacatg gccaagcgat tagaaacaca ctacagaatt 2700 
cttttcaggg gtgcaagagg ttatgtgggt catgaattta ttttggacac gagacccttc 2760 
aaaaagtctg caaatattga ggctgtggat gtggccaaga gactccagga ttatggattt 2820 
cacgccccta ccatgtcctg gcctgtggca gggaccctca tggtggagcc cactgagtcg 2880 
gaggacaagg cagagctgga cagattctgt gatgccatga tcagcattcg gcaggaaatt 2940 
gctgacattg aggagggccg catcgacccc agggtcaatc cgctgaagat gtctccacac 3000 
tccctgacct gcgttacatc ttcccactgg gaccggcctt attccagaga ggtggcagca 3060 
ttcccactcc ccttcatgaa accagagaac aaattctggc caacgattgc ccggattgat 3120 
gacatatatg gagatcagca cctggtttgt acctgcccac ccatggaagt ttatgagtct 3180 
ccattttctg aacaaaagag ggcgtcttct tagtcctctc tccctaagtt taaaggactg 3240 
atttgatgcc tctccccaga gcatttgata agcaagaaag atttcatctc ccaccccagc 3300 
ctcaagtagg agttttatat actgtgtata tctctgtaat ctctgtcaag gtaaatgtaa 3360 
atacagtagc tggagggagt cgaagctgat ggttggaaga cggatttgct ttggtattct 3420 
gcttccacat gtgccagttg cctggattgg gagccatttt gtgttttgcg tagaaagttt 3480 
taggaacttt aacttttaat gtggcaagtt tgcagatgtc atagaggcta tcctggagac 3 540 
ttaatagaca tttttttgtt ccaaaagagt ccatgtggac tgtgccatct gtgggaaatc 3600 
ccagggcaaa tgtttacatt ttgtataccc tgaagaactc tttttcctct aatatgccta 3660 
atctgtaatc acatttctga gtgttttcct ctttttctgt gtgaggtttt tttttttttt 3720 
aatctgcatt tattagtatt ctaataaaag cattttgatc gg .3762 

<210> 18 
<211> 1192 
<212> DNA 

<213> Homo sapiens 
<400> 18 

ggctccctcc ggccgcgaac tgcccctccc cgccccgcct cccggcgcgg gtggccgagg 60 

cgtagcgccg cgacccccgc acccctgcga acatggcgct gcgagtggtg cggagcgtgc 120 

gggccctgct ctgcaccctg cgcgcggtcc cgttacccgc cgcgccctgc ccgccgaggc 180 

cctggcagct gggggtgggc gccgtccgta cgctgcgcac tggacccgct ctgctctcgg 240 

tgcgtaaatt cacagagaaa cacgaatggg taacaacaga aaatggcatt ggaacagtgg 300 

gaatcagcaa ttttgcacag gaagcgttgg gagatgttgt ttattgtagt ctccctgaag 360 

ttgggacaaa attgaacaaa caagatgagt ttggtgcttt ggaaagtgtg aaagctgcta 420 

gtgaactata ttctccttta tcaggagaag taactgaaat taatgaagct cttgcagaaa 480 

atccaggact tgtaaacaaa tcttgttatg aagatggttg gctgatcaag atgacactga 540 

gtaacccttc agaactagat gaacttatga gtgaagaagc atatgagaaa tacataaaat 600 

ctattgagga gtgaaaatgg aactcctaaa taaactagta tgaaataacg aagccagcag 660 

agttgtctta aattagtggt ggatagagac ttagaataga aacttttagt attaccgatg 720 

gggcaaaaaa aaactactgt taacactgct aatgaaagaa aatgcccttt aactttgtaa 780 

tgattataga taaatataat atgcgtcttt ttcacaatat cctatgattt ttagactagg 840 

ctctagtgtt cagaattcat gaaattatcc atggtaaaaa ctagttataa aaattacata 900 

attcaaagat aacattgtta ttcttaagcc ttatataata ttgtaacttg catgtatcca 960 

tacctggatt tgggatgaaa tacttaatga tctttccatt ggaaataact ggaagtgaag 1020 

aggttttgtt gcttgtacag tgtcagatga ggaacaccac tatcttaatt ttgcgataca 1080 

ctgcatttgc tggtgctatt tttatacagt gaagcaacag ctttgcagca aaataataaa 1140 

atacttcttc gttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1192 

<210> 19 
<211> 2102 
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<212> DNA 

<213> Homo sapiens 

<400> 19 

tgcccacgcc cccttcagat cctttgctcc ggagagagac ctgtccgagc agaggcctgg 60 
actacatctc ccggcgtgcc tggcagtgtg gtggcctctg tgcgccgtct gcactcgttg 120 
caggcgacga tgcagagggc tgtaagtgtg gtggcccgtc tgggctttcg cctgcaggca 180 
ttccccccgg ccttgtgtcg tccacttagt tgcgcacagg aggtgctccg caggacaccg 240 
ctctatgact tccacctggc ccacggcggg aaaatggtgg cgtttgcggg ttggagtctg 300 
ccagtgcagt accgggacag tcacactgac tcgcacctgc acacacgcca gcactgctcg 360 
ctctttgacg tgtctcatat gctgcagacc aagatacttg gtagtgaccg ggtgaagctg 420 
atggagagtc tagtggttgg agacattgca gagctaagac caaaccaggg gacactgtcg 480 
ctgtttacca acgaggctgg aggcatctta gatgacttga ttgtaaccaa tacttctgag 540 
ggccacctgt atgtggtgtc caacgctggc tgctgggaga aagatttggc cctcatgcag 600 
gacaaggtca gggagcttca gaaccagggc agagatgtgg gcctggaggt gttggataat 660 
gccctgctag ctctgcaagg ccccactgca gcccaggtac tacaggccgg cgtggcagat 720 
gacctgagga aactgccctt catgaccagt gctgtgatgg aggtgtttgg cgtgtctggc 780 
tgccgcgtga cccgctgtgg ctacacagga gaggatggtg tggagatctc ggtgccggta 840 
gcgggggcag ttcacctggc aacagctatt ctgaaaaacc cagaggtgaa gctggcaggg 900 
ctggcagcca gggacagcct gcgcctggag gcaggcctct gcctgtatgg gaatgacatt ,960 
gatgaacaca ctacacctgt ggagggcagc ctcagttgga cactggggaa gcgccgccga 1020 
gctgctatgg acttccctgg agccaaggtc attgttcccc agctgaaggg cagggtgcag 1080 
cggaggcgtg tggggttgat gtgtgagggg gcccccatgc gggcacacag tcccatcctg 1140 
aacatggagg gtaccaagat tggtactgtg actagtggct gcccctcccc ctctctgaag 1200 
aagaatgtgg cgatgggtta tgtgccctgc gagtacagtc gtccagggac aatgctgctg 1260 
gtagaggtgc ggcggaagca gcagatggct gtagtcagca agatgccctt tgtgcccaca 1320 
aactactata ccctcaagtg aagctggctc agggtggggc tgtcccttcc aggagttttg 1380 
cccctacaag gggttagtca agaagctgag gcagaactca ctgggggtgg gcagttaagg 1440 
tggaggctga ttctaattgt ctggttgagg ggccacacca cctattcccc ccacctaact 1500 
catgccattc cagcttcctt caggaccctg cttctgagtg acggaccagc tcacacaatg 1560 
tcttgtttca gtccatgatc ccactgacct actcttgcct gctggagggt aatgagaagc 1620 
tttggttctg ccatctctcc cactctgcca ggtgctggct gtggagcaaa ggctcacctt 1680 
tgtggagagg ataaaacctg cccaacctac ctcaccatgg tttttcacat tgcaaagggt 1740 
aataacatgg gcagtgcgga cttaggctac cccctccagt ttgctttccg taaatgcaaa 1800 
ttgtccttac tgcaagtcag gaatgattgc tgactcacag tagggctgct atgcctgtgt 1860 
gtaaacttgg ggatggctga gggaacatag actcactctt ccacattccc aagttggtct 1920 
agtgtgctgc ccagtagcaa accatggcag actcaccacc tattctgagt tccagggctg 1980 
ctgtagggca gggtgggctt cctcccagac ttgccttacc ctgggctgat ctttgcccct 2040 
ggtatgcatt aatggactcc actgaatcct gaaaaaaaaa ttaaacttcc ttcttacttg 2100 
cc 2102 

<210> 20 

<211> 3228 

<212> DNA 

<213> Homo sapiens 

<400> 20 

aaaaaactca ggcaaagtca cagcctcaaa attgttcact gaaagaacgc tgagtggaga 60 
agtgtgagaa gatgaatgga ccggtggatg gcttgtgtga ccactctcta agtgaaggag 120 
tctteatgtt cacatcggag tctgtgggag agggacaccc ggataagatc tgtgaccaga 180 
tcagtgatgc agtgctggat gcccatctca agcaagaccc caatgccaag gtggcctgtg 240 
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<212> DNA 

<213> Homo sapiens 



<400> 19 

tgcccacgec cccttcagat cctttgctcc 
actacatctc ccggcgtgcc tggcagtgtg 
caggcgacga tgcagagggc tgtaagtgtg 
ttccccccgg ccttgtgtcg tccacttagt 
ctctatgact tccacctggc ccacggcggg 
ccagtgcagt accgggacag tcacactgac 
ctctttgacg tgtctcatat gctgcagacc 
atggagagtc tagtggttgg agacattgca 
ctgtttacca acgaggctgg aggcatctta 
ggccacctgt atgtggtgtc caacgctggc 
gacaaggtca gggagcttca gaaccagggc 
gccctgctag ctctgcaagg ccccactgca 
gacctgagga aactgccctt catgaccagt 
tgccgcgtga cccgctgtgg ctacacagga 
gcgggggcag ttcacctggc aacagctatt 
ctggcagcca gggacagcct gcgcctggag 
gatgaacaca ctacacctgt ggagggcagc 
gctgctatgg acttccctgg agccaaggtc 
cggaggcgtg tggggttgat gtgtgagggg 
aacatggagg gtaccaagat tggtactgtg 
aagaatgtgg cgatgggtta tgtgccctgc 
gtagaggtgc ggcggaagca gcagatggct 
aactactata ccctcaagtg aagctggctc 
cccctacaag gggttagtca agaagctgag 
tggaggctga ttctaattgt ctggttgagg 
catgccattc cagcttcctt caggaccctg 
tcttgtttca gtccatgatc ccactgacct 
tttggttctg ccatctctcc cactctgcca 
tgtggagagg ataaaacctg cccaacctac 
aataacatgg gcagtgcgga . cttaggctac 
ttgtccttac tgcaagtcag gaatgattgc 
gtaaacttgg ggatggctga gggaacatag 
agtgtgctgc ccagtagcaa accatggcag 
ctgtagggca gggtgggctt cctcccagac 
ggtatgcatt aatggactcc actgaatcct 
cc 



ggagagagac ctgtccgagc agaggcctgg 60 
gtggcctctg tgcgccgtct gcactcgttg 120 
gtggcccgtc tgggctttcg cctgcaggca 180 
tgcgcacagg aggtgctccg caggacaccg 240 
aaaatggtgg cgtttgcggg ttggagtctg 300 
tcgcacctgc acacacgcca gcactgctcg 360 
aagatacttg gtagtgaccg ggtgaagctg 420. 
gagctaagac caaaccaggg gacactgtcg 480 
gatgacttga ttgtaaccaa tacttctgag 540 
tgctgggaga aagatttggc cctcatgcag 600 
agagatgtgg gcctggaggt gttggataat 6.60 
gcccaggtac tacaggccgg cgtggcagat 720 
gctgtgatgg aggtgtttgg cgtgtctggc 7,80 
gaggatggtg tggagatctc ggtgccggta 840 
ctgaaaaacc cagaggtgaa gctggcaggg 900 
gcaggcctct gcctgtatgg gaatgacatt 960 
ctcagttgga cactggggaa gcgccgccga 1020.* 
attgttcccc agctgaaggg cagggtgcag 1080 : 
gcccccatgc gggcacacag tcccatcctg 1140 
actagtggct gcccctcccc ctctctgaag 1200 * 
gagtacagtc gtccagggac aatgctgctg 1260 
gtagtcagca agatgccctt tgtgcccaca 1320 
agggtggggc tgtcccttcc aggagttttg 1380 
gcagaactca ctgggggtgg gcagttaagg 1440 
ggccacacca cctattcccc ccacctaact 1500 
cttctgagtg acggaccagc tcacacaatg 1560 
actcttgcct gctggagggt aatgagaagc 1620 
ggtgctggct gtggagcaaa ggctcacctt 1680 
ctcaccatgg tttttcacat' tgcaaagggt 1740 
cccctccagt ttgctttccg taaatgcaaa 1800 
tgactcacag tagggctgct atgcctgtgt 1860 
actcactctt ccacattccc aagttggtct 1920 
actcaccacc tattctgagt tccagggctg 1980 
ttgccttacc ctgggctgat ctttgcccct 2040 
gaaaaaaaaa ttaaacttcc ttcttacttg 2100 

2102 



<210> 20 
<211> 3228 
<212> DNA 

<213> Homo sapiens 



<400> 20 

aaaaaactca ggcaaagtca cagcctcaaa 
agtgtgagaa gatgaatgga ccggtggatg 
tcttcatgtt cacatcggag tctgtgggag 
tcagtgatgc agtgctggat gcccatctca 



attgttcact gaaagaacgc tgagtggaga 60 
gcttgtgtga ccactctcta agtgaaggag 120 
agggacaccc ggataagatc tgtgaccaga 180 
agcaagaccc caatgccaag gtggcctgtg 240 
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agacagtgtg caagaccggc atggtgctgc 
tggactacca gcgggtggtg agggacacca 
agggctttga cttcaagact tgcaacgtgc 
ttgcccagtg cgtccatctg gacagaaatg 
tgatgttcgg ctatgctacc gacgagacag 
ctcacaagct caacgcccgg atggcagacc 
ggcctgactc taagactcag gtgacagttc 
ctgtgcgcat ccacaccatc gtcatctctg 
agatgcgcag ggccctgaag gagcaagtca 
acgaagacac cgtctaccac ctgcagccca 
gggatgcggg tgtcactggc cgtaagatta 
atggtggtgg ggccttctct gggaaggact 
ctgcccgctg ggtggccaag tctctggtga 
aggtttccta tgccattggt gtggccgagc 
cctctcagaa gacagagcga gagctgctgg 
cgggcgtcat tgtcagggat ttggacttga 
acggccattt cggaagaagc gagttcccat 
gccaggggga gctgggcctg gtctcaccct 
ccagacgcct ggctgctgat cgccttcccc 
cctctcattt agcctgtcct gtcatcatca 
ctggaggttg gatcttgatg taaggatggg 
ggggcaatgt taatttagtg gaaaaggcac 
tcccccaaca gctggaccct gaccagctcc 
gtcagcacat ctcaacaggc ctcagggctc 
tcacaggcag ccagtgccct gagccagggt 
tggcaggggt tagagcagga ctgatgtctc 
ctaataactg atctcgtttt ttcttcactg 
catttataga tgggggaaag caaagcacag 
agccacgtgt ttagacctag atgtataact 
cccccttccc caccgttgtc tcatgccagt 
agaaatgagg ccatggactc tgtcctgaag 
cccactagct cttgttgagc cgtgggcaat 
ctattttgtt tgcttcatgt gtcccagaag 
catcagggta ggctggagtt agggactgac 
actcagagac atgcagagat tgggtttttg 
ccagtcctat gtgggcttcc tggagcagaa 
ccccttagac agaggccctc aagtctttct 
ctgagtgcat ttggagggag tgagtggccc 
ctcatgttaa ataggcgggg cttgccttct 
ggagattctg cctttgccag gtgactgtct 
gcagtccttg gcctgctaag gtcttggaac 
acctgcccta cctgccccac ttgtccttag 
tctccccagc tcccaggacc acaggctcag 
tggggacttg gcctctctct agggtacatg 
tctggggagc aggcagccag catctgggcc 
attttctctc aggtgaaagg aaattaaggc 
ggaagcctca gattccttcc catggaggga 
taacttggct cacactcgac atgaaaattc 
aaataagatc ttttttgtca gtttgtttgt 
cctgctaaaa caatgaataa aaactcaagg 



tgtgtggtga gatcacctca atggccatgg 300 
tcaagcacat cggctacgat gactcagcca 360 
tggtggcttt ggagcagcaa tccccagata 420 
aggaggatgt gggggcagga gatcagggtt 480 
aggagtgcat gcccctcacc atcatccttg 540 
tcaggcgctc cggcctcctc ccctggctgc 600 
agtacatgca ggacaatggc gcagtcatcc 660 
tgcagcacaa cgaagacatc acgctggagg 720 
tcagggccgt ggtgccggcc aagtacctgg 780 
gtgggcggtt tgtcatcgga ggtccccagg 840 
ttgtggacac ctatggcggc tggggggctc 900 
acaccaaggt agaccgctca gctgcatatg 960 
aagcagggct ctgccggaga gtgcttgtcc 1020 
cgctgtccat ttccatcttc acctacggaa 1080 
atgtggtgca taagaacttc gacctccggc 1140 
agaagcccat ctaccagaag acagcatgct 1200 
gggaggttcc caggaagctt gtattttaga 1260 
ggaggcacct ggtggccatg ctcctcttcc 1320 
acccaccaac cctcagggca aagccaggtc 1380 
tggccagctg gaggcagggg cttcctggtg 1440 
catggtgttc tcctgctgct ccctcagact 1500 
ccccgtcaag agtgaattcc ctcactcgtc 1560 
ccctccctcc ccttgcctgt gccaggtgag 1620 
cttgtgggcc tgggctcctg gacccccctt 1680 
ctccagaaag ccccacccag gccaggcatg 1740 
ctaagcacct gtaatgtgcg agggacccag 1800 
caacatgatg aggtagtacc ttttatatcc 1860 
agagtctgga taacttccac agggtcccac 1920 
aggagctttg- actcaggagc ctgtgacata 1980 
aacaggctca aacaatgaca aagcagattc 2040 
gcctgaggtt actggaaatt aggggattaa 2100 
tgtctgaaaa gtgaagacag aaccacaggg 2160 
atgactgagg gtgagttggc ttacctggcc 2220 
cagcagcttt agaatcccag ccccctgacc 2280 
gacttctggg gtaagtggtc taagtccagt 2340 
gcagcaactt gtcctagcac agatggccag 2400 
ctttccctgg tcccttgtat cccctgcagg 2460 
tttcggatcc agggaggctg gtcctatggc 2520 
ggtgttggac aagcttctga gacgtcatga 2580 
ggggagcggg tctgctccca aggggcctga 2640 
ttgcctgcct ttccatccat ggccagcagc 2700 
cctggacctc tgacagcagc atctctacct 2760 
gcagggcctc catgggcccc aggggaacac 2820 
gtgctgggag aggcagccca ggaagtctca 2880 
ttggcctgga gcacaaagac cctggctttc 2940 
aacaaaagaa gcccggctcc tggtcaccta 3000 
gggagtggtt tgcaggtggc caagttcctc 3060 
agaattttat actttcccta ccctctagag 3120 
atgaaactaa agctttattt gttaatagtt 3180 
agcaactaaa aaaaaaaa 3228 
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agacagtgtg caagaccggc atggtgctgc 
tggactacca gcgggtggtg agggacacca 
agggctttga cttcaagact tgcaacgtgc 
ttgcccagtg cgtccatctg gacagaaatg 
tgatgttcgg ctatgctacc gacgagacag 
ctcacaagct caacgcccgg atggcagacc 
ggcctgactc taagactcag gtgacagttc 
ctgtgcgcat ccacaccatc gtcatctctg 
agatgcgcag ggccctgaag gagcaagtca 
acgaagacac cgtctaccac ctgcagccca 
gggatgcggg tgtcactggc cgtaagatta 
atggtggtgg ggccttctct gggaaggact 
ctgcccgctg ggtggccaag tctctggtga 
aggtttccta tgccattggt gtggccgagc 
cctctcagaa gacagagcga gagctgctgg 
cgggcgtcat tgtcagggat ttggacttga 
acggccattt cggaagaagc gagttcccat 
gccaggggga gctgggcctg gtctcaccct 
ccagacgcct ggctgctgat cgccttcccc 
cctctcattt agcctgtcct gtcatcatca 
ctggaggttg gatcttgatg taaggatggg 
ggggcaatgt taatttagtg gaaaaggcac 
tcccccaaca gctggaccct gaccagctcc 
gtcagcacat ctcaacaggc ctcagggctc 
tcacaggcag ccagtgccct gagccagggt 
tggcaggggt tagagcagga ctgatgtctc 
ctaataactg atctcgtttt ttcttcactg 
catttataga tgggggaaag caaagcacag 
agccacgtgt ttagacctag atgtataact 
cccccttccc caccgttgtc tcatgccagt 
agaaatgagg ccatggactc tgtcctgaag 
cccactagct cttgttgagc cgtgggcaat 
ctattttgtt tgcttcatgt gtcccagaag 
catcagggta ggctggagtt agggactgac 
actcagagac atgcagagat tgggtttttg 
ccagtcctat gtgggcttcc tggagcagaa 
ccccttagac agaggccctc aagtctttct 
ctgagtgcat ttggagggag tgagtggccc 
ctcatgttaa ataggcgggg cttgccttct 
ggagattctg cctttgccag gtgactgtct 
gcagtccttg gcctgctaag gtcttggaac 
acctgcccta cctgccccac ttgtccttag 
tctccccagc tcccaggacc acaggctcag 
tggggacttg gcctctctct agggtacatg 
tctggggagc aggcagccag catctgggcc 
attttctctc aggtgaaagg aaattaaggc 
ggaagcctca gattccttcc ca.tggaggga 
taacttggct cacactcgac atgaaaattc 
aaataagatc ttttttgtca gtttgtttgt 
cctgctaaaa caatgaataa aaactcaagg 



tgtgtggtga gatcacctca atggccatgg 300 
tcaagcacat cggctacgat gactcagcca 360 
tggtggcttt ggagcagcaa tccccagata 420 
aggaggatgt gggggcagga gatcagggtt 480 
aggagtgcat gcccctcacc atcatccttg 540 
tcaggcgctc cggcctcctc ccctggctgc 600 
agtacatgca ggacaatggc gcagtcatcc 660 
tgcagcacaa cgaagacatc acgctggagg 720 
tcagggccgt ggtgccggcc aagtacctgg 780 
gtgggcggtt tgtcatcgga ggtccccagg 840 
ttgtggacac ctatggcggc tggggggctc 900 
acaccaaggt agaccgctca gctgcatatg 960 
aagcagggct ctgccggaga gtgcttgtcc 1020 
cgctgtccat ttccatcttc acctacggaa 1080 
atgtggtgca taagaacttc gacctccggc 1140 
agaagcccat ctaccagaag acagcatgct 1200 
gggaggttcc caggaagctt gtattttaga 1260 
ggaggcacct ggtggccatg ctcctcttcc 1320 
acccaccaac cctcagggca aagccaggtc 1380 
tggccagctg gaggcagggg cttcctggtg 1440 
catggtgttc tcctgctgct ccctcagact 1500 
ccccgtcaag agtgaattcc ctcactcgtc 1560 
ccctccctcc ccttgcctgt gccaggtgag 1620 
cttgtgggcc tgggctcctg gacccccctt 1680 
ctccagaaag ccccacccag gccaggcatg 1740 
ctaagcacct gtaatgtgcg agggacccag 1800 
caacatgatg aggtagtacc ttttatatcc 1860 
agagtctgga taacttccac agggtcccac 1920 
aggagctttg actcaggagc ctgtgacaca 1980 
aacaggctca aacaatgaca aagcagattc 2040 
gcctgaggtt actggaaatt aggggattaa 2100 
tgtctgaaaa gtgaagacag aaccacaggg 2160 
atgactgagg gtgagttggc ttacctggcc 2220 
cagcagcttt agaatcccag ccccctgacc 2280 
gacttctggg gtaagtggtc taagtccagt 2340 
gcagcaactt gtcctagcac agatggccag 2400 
ctttccctgg tcccttgtat cccctgcagg 2460 
tttcggatcc agggaggctg gtcctatggc 2520 
ggtgttggac aagcttctga gacgtcatga 2580 
ggggagcggg tctgctccca aggggcctga 2640 
ttgcctgcct ttccatccat ggccagcagc 2700 
cctggacctc tgacagcagc atctctacct 2760 
gcagggcctc catgggcccc aggggaacac 2820 
gtgctgggag aggcagccca ggaagtctca 2880 
ttggcctgga gcacaaagac cctggctttc 2940 
aacaaaagaa gcccggctcc tggtcaccta 3000 
gggagtggtt tgcaggtggc caagttcctc 3060 
agaattttat actttcccta ccctctagag 3120 
atgaaactaa agctttattt gttaatagtt 3180 
agcaactaaa aaaaaaaa 3228 



26 



WO 00/71754 



PCT/US00/14354 



<210> 21 

<211> 344 

<212> PRT 

<213> Homo sapiens 

<400> 21 

Met Ser Ala Leu Ala Ala Arg Leu Leu Gin Pro Ala His Ser Cys Ser 
1 5 10 15 

Leu Arg Leu Arg Pro Phe His Leu Ala Ala Val Arg Asn Glu Ala Val 
20 25 30 

Val He Ser Gly Arg Lys Leu Ala Gin Gin He Lys Gin Glu Val Arg 
3 5 40, 45 

Gin Glu Val Glu Glu Trp Val Ala Ser Gly Asn Lys Arg Pro His Leu 
50 55 60 

Ser Val He Leu Val Gly Glu Asn Pro Ala Ser, His Ser Tyr Val Leu 
65 70 75 80 

Asn Lys Thr Arg Ala Ala, Ala Val Val Gly He Asn Ser Glu Thr He 
85 90 95 

Met Lys Pro Ala Ser He Ser Glu Glu Glu Leu Leu Asn Leu He Asn 
100 105 HO 

Lys Leu Asn Asn Asp Asp Asn Val Asp Gly Leu Leu Val Gin Leu Pro 
115 120 125 

Leu Pro Glu His He Asp Glu Arg Arg He Cys Asn Ala Val Ser Pro 
130 135 140 

Asp Lys Asp Val Asp Gly Phe His Val He Asn Val Gly Arg Met Cys 
145 150 155 160 

Leu Asp Gin Tyr Ser Met Leu Pro Ala Thr Pro Trp Gly Val Trp Glu 
165 170 ' 175 

He He Lys Arg Thr Gly He Pro Thr Leu Gly Lys Asn Val Val Val 
180 185 190 

Ala Gly Arg Ser Lys Asn Val Gly Met Pro lie Ala Met Leu Leu His 
195 200 205 

Thr Asp Gly Ala His Glu Arg Pro Gly Gly Asp Ala Thr Val Thr He 
210 215 220 

Ser His Arg Tyr Thr Pro Lys Glu Gin Leu Lys Lys His Thr He Leu 
225 230 235 240 



27 



WO 00/71754 



PCT/US00/14354 



<210> 21 
<211> 344 
<212> PRT 

<213> Homo sapiens 



<400> 21 

Met Ser Ala Leu Ala Ala Arg Leu Leu Gin Pro Ala His Ser Cys Ser 
1 5 10 15 

Leu Arg Leu Arg Pro Phe His Leu Ala Ala Val Arg Asn Glu Ala Val 
20 25 30 

Val He Ser Gly Arg Lys Leu Ala Gin Gin He Lys Gin Glu Val Arg 
35 40 45 

Gin Glu Val Glu Glu Trp Val Ala Ser Gly Asn Lys Arg Pro His Leu 
50 55 60 

Ser Val He Leu Val Gly Glu Asn Pro Ala Ser His Ser Tyr Val Leu 
65 70 75 80 

Asn Lys Thr Arg Ala Ala Ala Val Val Gly He Asn Ser Glu Thr He 
85 90 95 

Met Lys Pro Ala Ser He Ser Glu Glu Glu Leu Leu Asn Leu He Asn 
100 105 HO 

Lys Leu Asn Asn Asp Asp Asn Val Asp Gly Leu Leu Val Gin Leu Pro 
115 120 125 

Leu Pro Glu His He Asp Glu Arg Arg lie Cys Asn Ala Val Ser Pro 
130 135 140 

Asp Lys Asp Val Asp Gly Phe His Val He Asn Val Gly Arg Met Cys 
145 150 155 160 

Leu Asp Gin Tyr Ser Met Leu Pro Ala Thr Pro Trp Gly Val Trp Glu 
165 170 175 

He He Lys Arg Thr Gly He Pro Thr Leu Gly Lys Asn Val Val Val 
180 185 190 

Ala Gly Arg Ser Lys Asn Val Gly Met Pro He Ala Met Leu Leu His 
195 200 205 

Thr Asp Gly Ala His Glu Arg Pro Gly Gly Asp Ala Thr Val Thr He 
210 215 220 

Ser His Arg Tyr Thr Pro Lys Glu Gin Leu Lys Lys His Thr He Leu 
225 230 235 240 
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Ala Asp He Val He Ser Ala Ala Gly He Pro Asn Leu He Thr Ala 
245 250 255 

Asp Met He Lys Glu Gly Ala Ala Val He Asp Val Gly He Asn Arg 
260 265 270 

Val His Asp Pro Val Thr Ala Lys Pro Lys Leu Val Gly Asp Val Asp 
275 280 285 

Phe Glu Gly Val Arg Gin Lys Ala Gly Tyr He Thr Pro Val Pro Gly 
290 295 300 

Gly Val Gly Pro Met Thr Val Ala Met Leu Met Lys Asn Thr lie He 
305 310 315 320 

Ala Ala Lys Lys Val Leu Arg Leu Glu Glu Arg Glu Val Leu Lys Ser 
325 330 335 



Lys Glu Leu Gly Val Ala Thr Asn 
340 



<210> 22 

<211> 1283 ■ 

<212> DNA 

<213> Homo sapiens 

<400> 22 

tttcgcagcc gctgccgcct cgccgctgct 
ccaacatgaa cggacagctc aacggcttcc 
ttttcacctc agagtcggtc ggggaaggcc 
atgctgtcct tgatgcccac cttcagcagg 
ttgctaaaac tggaatgatc cttcttgctg 
accagaaagt ggttcgtgaa gctgttaaac 
ttgactacaa gacttgtaac gtgctggtag 
aaggtgttca tcttgacaga aatgaagaag 
ttggctatgc cactgatgaa actgaggagt 
agctaaatgc caaactggca gaactacgcc 
attctaaaac tcaagttact gtgcagtata 
gagtccacac aattgttata tctgttcagc 
gggatgccct aaaggagaaa gtcatcaaag 
atacaatcta ccacctacag ccaagtggca 
ctggtttgac tggacggaaa atcattgtgg 
gaggtgcctt ttcaggaaag gattatacca 
gttgggtggc aaaatccctt gttaaaggag 
cttatgctat tggagtttct catccattat 
agaagagtga gagagagcta ttagagattg 
tcattgtcag ggatctggat ctgaagaagc 
actttggtag ggacagcttc ccatgggaag 
agcctttttt ccccagactt gtt 



ccttcgtaag gccacttccg cacaccgaca 60 
acgaggcgtt catcgaggag ggcacattcc 120 
acccagataa gatttgtgac caaatcagtg 180 
atcctgatgc caaagtagct tgtgaaactg 240 
gggaaattac atccagagct gctgttgact 300 
acattggata tgatgattct tccaaaggtt 360 
ccttggagca acagtcacca gatattgctc 420 
acattggtgc tggagaccag ggcttaatgt 480 
gtatgccttt aaccattgtc ttggcacaca 540 
gtaatggcac tttgccttgg ttacgccctg 600 
tgcaggatcg aggtgctgtg cttcccatca 660 
atgatgaaga ggtttgtctt gatgaaatga 720 
cagttgtgcc tgcgaaatac cttgatgagg 780 
gatttgttat tggtgggcct cagggtgatg 840 
acacttatgg cggttggggt gctcatggag 900 
aggtcgaccg ttcagctgct tatgctgctc 960 
gtctgtgccg gagggttctt gttcaggtct 1020 
ctatctccat tttccattat ggtacctctc 1080 
tgaagaagaa tttcgatctc cgccctgggg 1140 
caatttatca gaggactgca gcctatggcc 1200 
tgcccaaaaa gcttaaatat tgaaagtgtt 1260 

1283 
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Ala Asp lie Val lie Ser Ala Ala Gly lie Pro Asn Leu lie Thr Ala 
245 250 255 

Asp Met He Lys Glu Gly Ala Ala Val lie Asp Val Gly He Asn Arg 
260 265 270 

Val His Asp Pro Val Thr Ala Lys Pro Lys Leu Val Gly Asp Val Asp 
275 280 285 

Phe Glu Gly Val Arg Gin Lys Ala Gly Tyr lie thr Pro Val Pro Gly 
290 295 300 

Gly Val Gly Pro Met Thr Val Ala Met Leu Met Lys Asn Thr He He 
305 310 315 ■ 320 

Ala Ala Lys Lys Val Leu Arg Leu Glu Glu Arg Glu Val Leu Lys Ser 
325 330 335 

Lys Glu Leu Gly Val Ala Thr Asn 
340 



<210> 22 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<400> 22 

tttcgcagcc gctgccgcct cgccgctgct 
ccaacatgaa cggacagctc aacggcttcc 
ttttcacctc agagtcggtc ggggaaggcc 
atgctgtcct tgatgcccac cttcagcagg 
ttgctaaaac tggaatgatc cttcttgctg 
accagaaagt ggttcgtgaa gctgttaaac 
ttgactacaa gacttgtaac gtgctggtag 
aaggtgttca tcttgacaga aatgaagaag 
ttggctatgc cactgatgaa actgaggagt 
agctaaatgc caaactggca gaactacgcc 
attctaaaac tcaagttact gtgcagtata 
gagtccacac aattgttata tctgttcagc 
gggatgccct aaaggagaaa gtcatcaaag 
atacaatcta ccacctacag ccaagtggca 
ctggtttgac tggacggaaa atcattgtgg 
gaggtgcctt ttcaggaaag gattatacca 
gttgggtggc aaaatccctt gttaaaggag 
cttatgctat tggagtttct catccattat 
agaagagtga gagagagcta ttagagattg 
tcattgtcag ggatctggat ctgaagaagc 
actttggtag ggacagcttc ccatgggaag 
agcctttttt ccccagactt gtt 



ccttcgtaag gccacttccg cacaccgaca 60 
acgaggcgtt catcgaggag ggcacattcc 120 
acccagataa gatttgtgac caaatcagtg 180 
atcctgatgc caaagtagct tgtgaaactg 240 
gggaaattac atccagagct gctgttgact 300 
acattggata tgatgattct tccaaaggtt 360 
ccttggagca acagtcacca gatattgctc 420 
acattggtgc tggagaccag ggcttaatgt 480 
gtatgccttt aaccattgtc ttggcacaca 540 
gtaatggcac tttgccttgg ttacgccctg 600 
tgcaggatcg aggtgctgtg cttcccatca 660 
atgatgaaga ggtttgtctt gatgaaatga 720 
cagttgtgcc tgcgaaatac cttgatgagg 780 
gatttgttat tggtgggcct cagggtgatg 840 
acacttatgg cggttggggt gctcatggag 900 
aggtcgaccg ttcagctgct tatgctgctc 960 
gtctgtgccg gagggttctt gttcaggtct 1020 
ctatctccat tttccattat ggtacctctc 1080 
tgaagaagaa tttcgatctc cgccctgggg 1140 
caatttatca gaggactgca gcctatggcc 1200 
tgcccaaaaa gcttaaatat tgaaagtgtt 1260 

1283 
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<210> 23 
<211> 3259 
<212> DNA 

<213> Homo sapiens 
<400> 23 

caaggttggt ggaagtcgcg ttgtgcaggt tcgtgcccgg ctggcgcggc gtggtttcac 60 
tgttacatgc cttgaagtga tgaggaggtt tctgttacta tatgctacac agcagggaca 120 
ggcaaaggcc atcgcagaag aaatgtgtga gcaagctgtg gtacatggat tttctgcaga 180 
tcttcactgt attagtgaat ccgataagta tgacctaaaa accgaaacag ctcctcttgt 240 
tgttgtggtt tctaccacgg gcaccggaga cccacccgac acagcccgca agtttgttaa 300 
ggaaatacag aaccaaacac tgccggttga tttctttgct cacctgcggt atgggttact 360 
gggtctcggt gattcagaat acacctactt ttgcaatggg gggaagataa ttgataaacg 420 
acttcaagag cttggagccc ggcatttcta tgacactgga catgcagatg actgtgtagg 480 
tttagaactt gtggttgagc cgtggattgc tggactctgg ccagccctca gaaagcattt 540 
taggtcaagc agaggacaag aggagataag tggcgcactc ccggtggcat cacctgcatc 600 
cttgaggaca gaccttgtga agtcagagct gctacacatt gaatctcaag tcgagcttct 660 
gagattcgat gattcaggaa gaaaggattc tgaggttttg aagcaaaatg cagtgaacag .720 . 
caaccaatcc aatgttgtaa ttgaagactt tgagtcctca cttacccgtt cggtaccccc 780 
actctcacaa gcctctctga atattcctgg tttaccccca gaatatttac aggtacatct 840 
gcaggagtct cttggccagg aggaaagcca agtatctgtg acttcagcag atccagtttt 900 
tcaagtgcca atttcaaagg cagttcaact tactacgaat gatgccataa aaaccactct 960 
gctggtagaa ttggacattt caaatacaga cttttcctat cagcctggag atgccttcag 1020 
cgtgatctgc cctaacagtg attctgaggt acaaagccta ctccaaagac tgcagcttga 1080 
agataaaaga gagcactgcg tccttttgaa aataaaggca gacacaaaga agaaaggagc 1140 
taccttaccc cagcatatac ctgcgggatg ttctctccag ttcattttta cctggtgtct 1200 
tgaaatccga gcaattccta aaaaggcatt tttgcgagcc cttgtggact ataccagtga 1260 
cagtgctgaa aagcgcaggc tacaggagct gtgcagtaaa caaggggcag ccgattatag 1320 
ccgctttgta cgagatgcct gtgcctgctt gttggatctc ctcctcgctt tcccttcttg 1380 
ccagccacca ctcagtctcc tgctcgaaca tcttcctaaa cttcaaccca gaccatattc 1440 
gtgtgcaagc tcaagtttat ttcacccagg aaagctccat tttgtcttca acattgtgga 1500 
atttctgtct actgccacaa cagaggttct gcggaaggga gtatgtacag gctggctggc 1560 
cttgttggtt gcttcagttc ttcagccaaa catacatgca tcccatgaag acagcgggaa 1620 
agccctggct cctaagatat ccatctctcc tcgaacaaca aattctttcc acttaccaga 1680 
tgacccctca atccccatca taatggtggg tccaggaacc ggcatagccc cgtttattgg 1740 
gttcctacaa catagagaga aactccaaga acaacaccca gatggaaatt ttggagcaat 1800 
gtggttgttt tttggctgca ggcataagga tagggattat ctattcagaa aagagctcag 1860 
acatttcctt aagcatggga tcttaactca tctaaaggtt tccttctcaa gagatgctcc 1920 
tgttggggag gaggaagccc cagcaaagta tgtacaagac aacatccagc ttcatggcca 1980 
gcaggtggcg agaatcctcc tccaggagaa cggccatatt tatgtgtgtg gagatgcaaa 2040 
gaatatggcc aaggatgtac atgatgccct tgtgcaaata ataagcaaag aggttggagt 2100 
tgaaaaacta gaagcaatga aaaccctggc cactttaaaa gaagaaaaac gctaccttca 2160 
ggatatttgg tcataaaacc agaaattaaa gaaagaggat taagcttttt tgactgaaag 2220 
tactaaaagt cagctttact agtgccaaac ctttaaattt tcaaaagaaa attttctttc 2280 
aacatttctt gaaggacatg gagtggagat tggatcattt aacaatataa caaaacttcc 2340 
tgatttgatt ttacgtatct tctatctacg cccttcctgt gcctgtgact ctccccaaat 2400 
tgccctgttg ccttgagctc ttctgagcta aaggcagcct tcagtcccta tcagcgcctc 2460 
ctttacttcc cagagaactt cacagagact ctgtccttcc atgcaaaggc ttcctgaaat 2520 
aggggagact gactgagtag ctcattcttg tgacttacag tgccaacatt taaaaaagta 2580 
tgaaaatgat ttatttttat atgatgtata cccataaaga atgctcatat taatgtactt 2640 
aaattacaca tgtagagcat atctgttata tgtttatgta actatcaaat ggttatttgt 2700 
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<210> 23 
<211> 3259 
<212> DNA 

<213> Homo sapiens 
<400> 23 

caaggttggt ggaagtcgcg ttgtgcaggt tcgtgcccgg ctggcgcggc gtggtttcac 60 
tgttacatgc cttgaagtga tgaggaggtt tctgttacta tatgctacac agcagggaca 120 
ggcaaaggcc atcgcagaag aaatgtgtga gcaagctgtg gtacatggat tttctgcaga 180 
tcttcactgt attagtgaat ccgataagta tgacctaaaa accgaaacag ctcctcttgt 240 
tgttgtggtt tctaccacgg gcaccggaga cccacccgac acagcccgca agtttgttaa 300 
ggaaatacag aaccaaacac tgccggttga tttctttgct cacctgcggt atgggttact 3 60 
gggtctcggt gattcagaat acacctactt ttgcaatggg gggaagataa ttgataaacg 420 
acttcaagag cttggagccc ggcatttcta tgacactgga catgcagatg actgtgtagg 480 
tttagaactt gtggttgagc cgtggattgc tggactctgg ccagccctca gaaagcattt 540 
taggtcaagc agaggacaag aggagataag tggcgcactc ccggtggcat cacctgcatc 600 
cttgaggaca gaccttgtga agtcagagct gctacacatt gaatctcaag tcgagcttct 660 
gagattcgat gattcaggaa gaaaggattc tgaggttttg aagcaaaatg cagtgaacag 720 
caaccaatcc aatgttgtaa ttgaagactt tgagtcctca cttacccgtt cggtaccccc 780 
actctcacaa gcctctctga atattcctgg tttaccccca gaatatttac aggtacatct 840 
gcaggagtct cttggccagg aggaaagcca agtatctgtg acttcagcag atccagtttt 900 
tcaagtgcca atttcaaagg cagttcaact tactacgaat gatgccataa aaaccactct 960 
gctggtagaa ttggacattt caaatacaga cttttcctat cagcctggag atgccttcag 1020 
cgtgatctgc cctaacagtg attctgaggt acaaagccta ctecaaagac tgcagcttga 1080 
agataaaaga gagcactgcg tccttttgaa aataaaggca gacacaaaga agaaaggagc 1140 
taccttaccc cagcatatac ctgcgggatg ttctctccag ttcattttta cctggtgtct 1200 
tgaaatccga gcaattccta .aaaaggcatt tttgcgagcc cttgtggact ataccagtga 1260 
cagtgctgaa aagcgcaggc tacaggagct gtgcagtaaa caaggggcag ccgattatag 1320 
ccgctttgta cgagatgcct gtgcctgctt gttggatctc ctcctcgctt tcccttcttg 1380 
ccagccacca ctcagtctcc tgctcgaaca tcttcctaaa cttcaaccca gaccatattc 1440 
gtgtgcaagc tcaagtttat ttcacccagg aaagctccat tttgtcttca acattgtgga 1500 
atttctgtct actgccacaa cagaggttct gcggaaggga gtatgtacag gctggctggc 1560 
cttgttggtt gcttcagttc ttcagccaaa catacatgca tcccatgaag acagcgggaa 1620 
agccctggct cctaagatat ccatctctcc tcgaacaaca aattctttcc acttaccaga 1680 
tgacccctca atccccatca taatggtggg tccaggaacc ggcatagccc cgtttattgg 1740 
gttcctacaa catagagaga aactccaaga acaacaccca gatggaaatt ttggagcaat 1800 
gtggttgttt tttggctgca ggcataagga tagggattat ctattcagaa aagagctcag 1860 
acatttcctt aagcatggga tcttaactca tctaaaggtt tccttctcaa gagatgctcc 1920 
tgttggggag gaggaagccc cagcaaagta tgtacaagac aacatccagc ttcatggcca 1980 
gcaggtggcg agaatcctcc tccaggagaa cggccatatt tatgtgtgtg gagatgcaaa 2040 
gaatatggcc aaggatgtac atgatgccct tgtgcaaata ataagcaaag aggttggagt 2100 
tgaaaaacta gaagcaatga aaaccctggc cactttaaaa gaagaaaaac gctaccttca 2160 
ggatatttgg tcataaaacc agaaattaaa gaaagaggat taagcttttt tgactgaaag 2220 
tactaaaagt cagctttact agtgccaaac ctttaaattt tcaaaagaaa attttctttc 2280 
aacatttctt gaaggacatg gagtggagat tggatcattt aacaatataa caaaacttcc 2340 
tgatttgatt ttacgtatct tctatctacg cccttcctgt gcctgtgact ctccccaaat 2400 
tgccctgttg ccttgagctc ttctgagcta aaggcagcct tcagtcccta tcagcgcctc 2460 
ctttacttcc cagagaactt cacagagact ctgtccttcc atgcaaaggc ttcctgaaat 2520 
aggggagact gactgagtag ctcattcttg tgacttacag tgccaacatt taaaaaagta 2580 
tgaaaatgat ttatttttat atgatgtata cccataaaga atgctcatat taatgtactt 2640 
aaattacaca tgtagagcat atctgttata tgtttatgta actatcaaat ggttatttgt 2700 
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tactaaagct atatttctga taaaaaatat 
ttttatgatg ctgggaaata tgaaatgtat 
atctatcacc attacttttt tttaagtcac 
atttacaggt accagtacgt acatatttta 
ccttttattt ctgctgcttg gcacattttt 
accactcaag cagtgtgctg gacctaaaat 
agattcccca gtgtctaatt ccctgttata 
aatctttctc cactgttcta atatatattg 
catctctgtt gaaggctttt gatccttttg 
tcttaaaaaa aaaaaaaaa 



tttaggataa ttgcctacag agggatttat 2760 
tttaaaattt cactctgggc atatggattt 2820 
aatttcagaa ttttgggaca tttgcattca 2880 
atagaaagat acaacctttt tattttcact 2940 
gagttttccc acattatttg tctccatgat 3000 
actgacttta gttagtatcc ttggattttt 3060 
atttgcacaa acaaaacaaa atgttatgat 3120 
tatttttatt tgatagcttg ggatttaaaa 3180 
agaaataaag atctgaaaga aatggcataa 3240 

3259 



<210> 24 

<211> 1805 

<212> DNA 

<213> Homo sapiens 



<400> 24 

aagagactga actgtatctg cctctatttc 
caaagccggg aaaattttat tagtcctttt 
aaaaaaaaaa ggaacctgaa ctttagtaac 
gcggcgggag aagaggttta atttagttga 
tcacggtgat ggaagctgca cattttttcg 
tctcccggca gcagcccgac gcaaaccaag 
ctgagtggga catacttttg aaggatgtgc 
acaagcagga agcttatgta ctcagtgaga 
ttttgaagac atgtggtacc accctcttgc 
ctagggatta cagtgggttt gactcaattc 
tgaagccttc tcaccaaggg tacccacacc 
atgcaatttt cccaaatgga gcaggatatt 
acttatatac tctggatttc ccagagagtc 
aaattctgat gagtgagctt gacccagcag 
ttactgcaaa ggatgtcact cgtgagagtg 
ttgatgccac aatgttcaat ccttgtgggt 
cttattggac tattcacatc actccagaac 
acttaagtca gacctcctat gatgacctga 
gaaaatttgt gaccaccttg tttgttaatc 
cgccccagaa gattgaaggt tttaagcgtc 
acaattttgt ttttaccagt tttgctaaga 
aaatgaagaa aaaacgcaaa aagagaacac 
tcgatgctgg gggcagtgct ttccataacc 
gtaatgatag tgtaatcatt ttgaattgta 
tgcatgaatg ctctcttctg tgtttaggta 
tggatgtaga aaaaaccttt tactatatga 
tttggtttat gcacagtgta atatttctcc 
gctttcgtcc tcattaggtg ttggcctcag 
gctgccagaa ttttacatcc agttacctcc 
attgaaacca atttctactt catactgatg 
ccatg 



caaaagactc acgttcaact ttcgctcaca 60 
tttaaaaaaa gttaatataa aattatagca 120 
acagctggaa caatcgcagc ggcggcggca 180, 
ttttctgtgg ttgttggttg ttcgctagtc 240 
aagggaccga gaagctgctg gaggtttggt 300, 
gatctgggga tcttcgcact atcccaagat 360 
aatgttcaat cataagtgtg acaaaaactg 420 
gtagcatgtt tgtctccaag agacgtttca 480 
tgaaagcact ggttcccctg ttgaagcttg 540 
aaagcttctt ttattctcgt aagaatttca 600 
ggaatttcca ggaagaaata gagtttctta 660 
gtatgggacg tatgaattct gactgttggt 720 
gggtaatcag tcagccagat caaaccttgg 780 
ttatggacca gttctacatg aaagatggtg 840 
gaattcgtga cctgatacca ggttctgtca 900 
attcgatgaa tggaatgaaa tcggatggaa 960 
cagaattttc ttatgttagc tttgaaacaa 1020 
tcaggaaagt tgtagaagtc ttcaagccag 1080 
agagttctaa atgtcgcaca gtgcttgctt 1140 
ttgattgcca gagtgctatg ttcaatgatt 1200 
agcagcaaca acagcagagt tgattaagaa 1260 
atgtagaagg tggtggatgc tttctagatg 1320 
accactgtgt agttgcagaa agccctagat 1380 
tgcattatta tatcaaggag ttagatatct 1440 
ttctctgcca ctcttgctgt gaaattgaag 1500 
aactttacaa cacttgtgaa agcaactcaa 1560 
aagtatcatc caaaattccc cacagacaag 1620 
cctaaccctc taggactgtt ctattaaatt 1680 
actttctaga acatattctt tactaatgtt 1740 
tttttggaaa cagcaattaa agtttttctt 1800 

1805 



<210> 25 
<211> 254 
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tactaaagct atatttctga taaaaaatat 
ttttatgatg ctgggaaata tgaaatgtat 
atctatcacc attacttttt tttaagtcac 
atttacaggt accagtacgt acatatttta 
ccttttattt ctgctgcttg gcacattttt 
accactcaag cagtgtgctg gacctaaaat 
agattcccca gtgtctaatt ccctgttata 
aatctttctc cactgttcta atatatattg 
catctctgtt gaaggctttt gatccttttg 
tcttaaaaaa aaaaaaaaa 



tttaggataa ttgcctacag agggatttat 2760 
tttaaaattt cactctgggc atatggattt 2820 
aatttcagaa ttttgggaca tttgcattca 2880 
atagaaagat acaacctttt tattttcact 2940 
gagttttccc acattatttg tctccatgat 3000 
actgacttta gttagtatcc ttggattttt 3060 
atttgcacaa acaaaacaaa atgttatgat 3120 
tatttttatt tgatagcttg ggatttaaaa 3180 
agaaataaag atctgaaaga aatggcataa 3240 

3259 



<210> 24 

<211> 1805 

<212> DNA 

<213> Homo sapiens 



<400> 24 

aagagactga actgtatctg cctctatttc 
caaagccggg aaaattttat tagtcctttt 
aaaaaaaaaa ggaacctgaa ctttagtaac 
gcggcgggag aagaggttta atttagttga 
tcacggtgat ggaagctgca cattttttcg 
tctcccggca gcagcccgac gcaaaccaag 
ctgagtggga catacttttg aaggatgtgc 
acaagcagga agcttatgta ctcagtgaga 
ttttgaagac atgtggtacc accctcttgc 
ctagggatta cagtgggttt gactcaattc 
tgaagccttc tcaccaaggg tacccacacc 
atgcaatttt cccaaatgga gcaggatatt 
acttatatac tctggatttc ccagagagtc 
aaattctgat gagtgagctt gacccagcag 
ttactgcaaa ggatgtcact cgtgagagtg 
ttgatgccac aatgttcaat ccttgtgggt 
cttattggac tattcacatc. actccagaac 
acttaagtca gacctcctat gatgacctga 
gaaaatttgt gaccaccttg tttgttaatc 
cgccccagaa gattgaaggt tttaagcgtc 
acaattttgt ttttaccagt tttgctaaga 
aaatgaagaa aaaacgcaaa aagagaacac 
tcgatgctgg gggcagtgct ttccataacc 
gtaatgatag tgtaatcatt ttgaattgta 
tgcatgaatg ctctcttctg tgtttaggta 
tggatgtaga aaaaaccttt tactatatga 
tttggtttat gcacagtgta atatttctcc 
gctttcgtcc tcattaggtg ttggcctcag 
gctgccagaa ttttacatcc agttacctcc 
attgaaacca atttctactt catactgatg 
ccatg 



caaaagactc acgttcaact ttcgctcaca 60 
tttaaaaaaa gttaatataa aattatagca 120 
acagctggaa caatcgcagc ggcggcggca 180 
ttttctgtgg ttgttggttg ttcgctagtc 240 
aagggaccga gaagctgctg gaggtttggt 300 
gatctgggga tcttcgcact atcccaagat 360 
aatgttcaat cataagtgtg acaaaaactg 420 
gtagcatgtt tgtctccaag agacgtttca 480 
tgaaagcact ggttcccctg ttgaagcttg 540 
aaagcttctt ttattctcgt aagaatttca 600 
ggaatttcca ggaagaaata gagtttctta 660 
gtatgggacg tatgaattct gactgttggt 720 
gggtaatcag tcagccagat caaaccttgg 780 
ttatggacca gttctacatg aaagatggtg 840 
gaattcgtga cctgatacca ggttctgtca 900 
attcgatgaa tggaatgaaa tcggatggaa 960 
cagaattttc ttatgttagc tttgaaacaa 1020 
tcaggaaagt tgtagaagtc ttcaagccag 1080 
agagttctaa atgtcgcaca gtgcttgctt 1140 
ttgattgcca gagtgctatg ttcaatgatt 1200 
agcagcaaca acagcagagt tgattaagaa 1260 
atgtagaagg tggtggatgc tttctagatg 1320 
accactgtgt agttgcagaa agccctagat 1380 
tgcattatta tatcaaggag ttagatatct 1440 
ttctctgcca ctcttgctgt gaaattgaag 1500 
aactttacaa cacttgtgaa agcaactcaa 1560 
aagtatcatc caaaattccc cacagacaag 1620 
cctaaccctc taggactgtt ctattaaatt 1680 
actttctaga acatattctt tactaatgtt 1740 
tttttggaaa cagcaattaa agtttttctt 1800 

1805 
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<212> PRT 

<213> Homo sapiens 
<400> 25 

Gin Asp lie Leu Val Phe Arg Ser Lys Thr Tyr Gly Asn Val Leu Val 
5 10 15 

Leu Asp Gly val lie Gin Cys Thr Glu Arg Asp Glu Phe Ser Tyr Gin 
20 25 30 

Glu Met lie Ala Asn Leu Pro Leu Cys Ser His Pro Ash Pro Arg Lys 
■ 35 - 40 45 " ■ 

Val Leu lie lie Gly Gly Gly Asp Gly Gly Val Leu Arg Glu Val Val 



60 

Lys His Pro Ser Val Glu Ser Val Val Gin Cys Glu lie Asp Glu Asp 

70 75 



80 



Val lie Gin Val Ser Lys Lys Phe Leu Pro Gly Met Ala lie Gly Tyr 
85 90 95 

Ser Ser Ser Lys Leu Thr Leu His Val Gly Asp Gly Phe Glu Phe Met 
100 • 105 no 

Lys Gin Asn Gin Asp Ala Phe Asp Val lie lie Thr Asp Ser Ser Asp 
115 120 125 

Pro Met Gly Pro Ala Glu Ser Leu Phe Lys Glu Ser Tyr Tyr Gin Leu 

135 140 

Met Lys Thr Ala Leu Lys Glu Asp Gly Val Leu Cys Cys Gin Gly Glu 

150 . 155 160 

Cys Gin Trp Leu His Leu Asp Leu lie Lys Glu Met Arg Gin Phe Cys 
165 170 175 



Gin Ser Leu Phe Pro Val Val Ala Tyr Ala Tyr Cys Thr II 



180 las 



e Pro Thr 
190 



Tyr Pro Ser Gly Gin lie Gly Phe Met Leu Cys Ser Lys Asn Pro Ser 
195 200 205 

Thr Asn Phe Gin Glu Pro Val Gin Pro Leu Thr Gin Gin Gin Val Ala 

215 220 

22? L6U LyS ^ r Asn Ser **P Val His Arg Ala Ala Phe 

230 23 5 240 

Val Leu Pro Glu Phe Ala Arg Lys Ala Leu Asn Asp Val Ser 
245 250 
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<212> PRT 

<213> Homo sapiens 

<400> 25 

Gin Asp lie Leu Val Phe Arg Ser Lys Thr Tyr Gly Asn Val Leu Val 
•1 5 10 15 

Leu Asp Gly Val He Gin Cys Thr Glu Arg Asp Glu Phe Ser Tyr Gin 
20 25 30 

Glu Met He Ala Asn Leu Pro Leu Cys Ser His Pro Asn Pro Arg Lys 
35 40 45 

Val Leu He He Gly Gly Gly Asp Gly Gly Val Leu Arg Glu Val Val , 
50 55 60 

Lys His Pro Ser Val Glu Ser Val Val Gin Cys Glu He Asp Glu Asp 
65 70 75 80 

Val He Gin Val Ser Lys Lys Phe Leu Pro Gly Met Ala He Gly Tyr 
85 90 95 

Ser Ser Ser Lys Leu Thr Leu His Val Gly Asp Gly Phe Glu Phe Met 
100 105 HO 

Lys Gin Asn Gin Asp Ala Phe Asp Val He lie Thr Asp Ser Ser Asp 
115 120 125 

Pro Met Gly Pro Ala Glu Ser Leu Phe Lys Glu Ser Tyr Tyr Gin Leu 
130 135 140 

Met Lys Thr Ala Leu Lys Glu Asp Gly Val Leu Cys Cys Gin Gly Glu 
145 150 155 160 

Cys Gin Trp Leu His Leu Asp Leu He Lys Glu Met Arg Gin Phe Cys 
165 170 175 

Gin Ser Leu Phe Pro Val Val Ala Tyr Ala Tyr Cys Thr He Pro Thr 
180 185 190 

Tyr Pro Ser Gly Gin He Gly Phe Met Leu Cys Ser Lys Asn Pro Ser 
195 200 205 

Thr Asn Phe Gin Glu Pro Val Gin Pro Leu Thr Gin Gin Gin Val Ala 
210 215 220 

Gin Met Gin Leu Lys Tyr Tyr Asn Ser Asp Val His Arg Ala Ala Phe 
225 230 235 240 

Val Leu Pro Glu Phe Ala Arg Lys Ala Leu Asn Asp Val Ser 
245 250 
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<210> 26 
<211> 2211 
<212> DNA 

<213> Homo sapiens 



<400> 26 

tgcSacaa laS"^ C « 9tttCCa tcacg.gtgc cgccagcatg tctgacaaac 60 
ctoSaS ag * CgCCgaC ^tcggcctgg ctgcctgggg acgcaaggcc ctggacattg 120 
tllll™ g ^atgccgggc ctgatgcgta tgcgggagcg gtactcggcc tccaagccac 180 
tgaagggcgc ccgcatcgct ggctgcctgc acatgaccgt ggagacggec gtcctcatta 240 
agaccctcgt caccctgggt gctgaggtgc agtggtccag XgJaSSS ttctccacc? 300 
llt^alT, ^T^ 0 attgccaagg ctggcattcc gg^gtatgcc tggaagggcg ■ 3 60 
= 9 ggagtacctg tggtgcattg agcagaccct gtacttcaag ■ gacggSc 420 
tcaacatgat tctggacgac gggggcgacc tcaccaacct catccacace aagScccgc 480 
agcttctgcc aggcatccga ggcatctctg aggagaccac gactggggtc caSJcctct ill 
acaagatgat ggccaatggg atcctcaagg tgcctgccat LatgKaat gactccgtca 600 

SEa Saataa^ ^"^^ ^^gagtc cctcatagat ggcatcfagc 6 0 ' 
J ! ^ tgtgatgatt gccggcaagg tagcggtggt agcaggctat ggtgatgtgg 720 
gcaagggctg tgcccaggcc ctgcggggtt tcggagcccg cgtcatcatc Iccaaalttl 780 
cctatcaaoa gCtgCCatgg ^ggccatga ggtgaccacc atggatgagg 84 

I ™ g <™ g SSS2 ».SKs! Z - 
pcgc ~s S£: SSS3 S2ES a ™ Sfi 

gggccacccc agcttcgtga tgagtaactc cttcaccaac caggtgatg^ Seagate- 1260 
C3tCCagaca ^taccccgt tggggttcat ttStJccS agaagcSa S 
tgaggcagtg gctgaagccc acctgggcaa gctgaatgtg aagttgacca agctaacSa 1380 
gaagcaagcc cagtacctgg gcatgtcctg tgatggcccc ttcaagccgg aScSg lllo 
ct a cS^ 3g CCag9tCtgC tfWcaccct ccagctgctg tccttgccca ggccccacct 1500 
ctcctcccta agagctaatg gcaccaactt tgtgattggt ttgtcagtgt cccccatS n 
ctctctgggg ctgatcactt agtttttggc ctctgctgL gccgSac tgttccaSt 
gtggcagcgg gaacagagta ccctcttcaa gccc^ggLa ^ggaggt cccagccaca 1680 
gggaaccatg agctcagtgg tcttggaaca gctcactaag tcagtcct?c cttaacctao llll 
aagtcagtag tggagtcaca aagcccatgt gttttgcca? ctaggStc acctggtS 1800 

££££ ataSaS MtS ^ J! 

S= SSSS SSS g :ScS is; 

taaaaSf ttagaataga ctgttaaggg gcaactgaga aagaacagag aagtgacagc 2040 
acSSS 3 ga " ggccag "aaacatga atgcaggcag atttcgtgaa atctgccalc 2100 
lit, " aaC cagatg 9" c ctttcacaac cctgggtcaa aaagagaata atttggccta 2160 
taatgttaaa agaaagcagg aaggtgggta aataaaaatc ttggtgcctg g"" 0 " 3 ™ 



<210> 27 

<211> 2436 

<212> DNA 

<213> Homo sapiens 
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<210> 26 
<211> 2211 
<212> DNA 

<213> Homo sapiens 
<400> 26 



ssEr sss ~- -~ ssss ssss 

E E IF- SSS SSSSS S2S2S ~ 

=3 SEE S iiil = — s - ' 

ccaagagcaa gtttgacaac ctctatggct gSSSaSc rSS ^ 9 act ccgtca 600 
cctgtLgg: ggg'aStc SatcaS aS " Ctat9a ^tgaccacc atgga^gagg 840 - 

«S = ~ S»:SE3S Si 

gga^gatgc? "SgJgtgta aeattggafa JtSSS ^T'" 9 1020 

sass S2ss a£ ™* szss 

gggcca'ccc agcttcgtga tgSaac?* 2??Sf 9 9tCaaCCt ^ ^tgtgccat 1200 
gctgcggacc catccagala agSccccgt tgSttcat J!*?" 1 ** ^"gatcga 1260 

SSK SS3S ™ ~ ™> SS" 

ctactgagag ccaggtcS atSa^? t9at " cccc "caagccgg atcactaccg 1440 

S -~ ~< £E£ i.15. 

HStcwS S2 a »S"t« «t«g=ct„ 1,40 

HI ES Sli Pf 2TS SS s 
EE SiS =? i= =i SS ~ 

ssss 2sse ssas ss3ss r"-~" s; 

<210> 27 

<211> 2436 

<212> DNA 

<213> Homo sapiens 
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<400> 27 

cgaccacctg tctggacacc acaaagatgc cacccgttgg gggcaaaaag gccaagaagg 60 
gcatcctaga acgtttaaat gctggagaga ttgtgattgg agatggaggg tttgtctttg 120 
cactggagaa gaggggctac gtaaaggcag gaccctggac tcctgaagct gctgtggagc 180 
acccagaagc agttcgccag cttcatcgag agttcctcag agctggctca aacgtcatgc 240 
agaccttcac cttctatgcg agtgaagaca agctggagaa caggggcaac tatgtcttag 300 
agaagatatc tgggcaggaa gtcaatgaag ctgcttgcga catcgcccga caagtggctg 360 
atgaaggaga tgctttggta gcaggaggag tgagtcagac accttcatac cttagctgca 420 
agagtgaaac tgaagtcaaa aaagtatttc tgcaacagtt agaggtcttt atgaagaaga 480 
acgtggactt cttgattgca gagtattttg aacacgttga agaagctgtg tgggcagttg 540 
aaaccttgat agcatccggt aaacctgtgg cagcaaccat gtgcattggc ccagaaggag 600 
atttgcatgg cgtgcccccc ggcgagtgtg cagtgcgcct ggtgaaagca ggagcatcca 660 
tcattggtgt gaactgccac tttgacccca ccattagttt aaaaacagtg aagctcatga 720 
aggagggctt ggaggctgcc caactgaaag ctcacctgat gagccagccc ttggcttacc 780 
acactcctga ctgcaacaag cagggattca .tcgatctccc agaattccca tttggactgg 840 
aacccagagt tgccaccaga tgggatattc aaaaatacgc cagagaggcc tacaacctgg 900 
gggtcaggta cattggcggg tgctgtggat ttgagcccta ccacatcagg gcaattgcag 960 
aggagctggc cccagaaagg ggctttttgc caccagcttc agaaaaacat ggcagctggg 1020 
gaagtggttt ggacatgcac accaaaccct gggttagagc aagggccagg aaggaatact 1080 
gggagaatct tcggatagcc tcaggccggc catacaaccc ttcaatgtca aagccagatg 1140 
gctggggagt gaccaaagga acagccgagc tgatgcagca gaaagaagcc acaactgagc 1200 
agcagctgaa agagctcttt gaaaaacaaa aattcaaatc acagtagcct cgatagaagc 1260 
tatttttgat gaatttctag gtgtttgggt cacagttcct acaaatacgg aaaagggggt 1320 
taaaaagcag tgctttcatg aatgccatcc tacacatatt attgctatta cctgaacaaa 1380 
atagaattac aaatagcact tgataatttt aaagtatgtt ttagaaattt tcttaggagc 1440 
aaaataagta caaagtaaat cttgaacagg ttcactaagc acccaccctg tgaaaagtat 1500 
tatggaaatc actgcagcac aggaaaagta attcagatgt taatgccact tgaagaagtt 1560 
ggtaggctag caaagaggat gagacatgaa ctgtcataaa ggactcagca accagccagg 1620 
gacagataaa gcgctatgga aaggggcttc caagttcttt tgaacatgac ccttagtaac 1680 
aaacacaatt tatataatga cccagcaaaa cacatcacat cttactgtcg aaattaaatg 1740 
tgtgatccat cctagtattt tctgttccat tccttttcat tctatttcat ttataaaaca 1800 
tgctagttga gacttttcaa atggattttt atgacccact actgggtttg gatccacagt 1860 
ttgaaaaata ttgctacaag acacttaagg agaccatcct gtttaagttt attcttataa 1920 
gtaggtcagt catatgagac ctgatcaata aatatccaat acccagagtc ctgctctcag 1980 
agttcttctg tttcgtgacc cacttttcta ccagtaaaag acatagacca atggggagga 2040 
ggggaggaga gatggatatt tcagccctct ccatcctagt caacactgga tccacctagt 2100 
gcctctgggc cataaggctg agcagagtga gcttgtatta gttggtagct tttaaaaaat 2160 
ataataaaaa aaaagtagag attctccaaa ctctagcctg gtttcctaga ttgagaacta 2220 
tgatattttt ctctgataat ttaatatcta ctctcctaca aaagctcaag cctgaagata 2280 
caagactatt agaagaaaca tgactaccct cagtgtatta gaaaagaggt catgcagctt 2340 
tctaaacatt attgaattgt ttgagctgtt ttgaaattgt aattcttttc agctattaaa 2400 
aagaagagca atgagaaaaa aaaaaaaaaa aaaaaa 2436 

<210> 28 

<211> 1326 

<212> DNA 

<213> Homo sapiens 

<400> 28 

ttcttttcct ctcttcttct ttcgcggttc agcatgcagg aaaaagacgc ctcctcacaa 60 
ggtttcctgc cacacttcca acatttcgcc acgcaggcga tccatgtggg ccaggatccg 120 
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<400> 27 

cgaccacctg tctggacacc acaaagatgc cacccgttgg gggcaaaaag gccaagaagg 60 
gcatcctaga acgtttaaat gctggagaga ttgtgattgg agatggaggg tttgtctttg 120 
cactggagaa gaggggctac gtaaaggcag gaccctggac tcctgaagct gctgtggagc 180 
acccagaagc. agttcgccag cttcatcgag agttcctcag agctggctca aacgtcatgc 240 
agaccttcac cttctatgcg agtgaagaca agctggagaa caggggcaac tatgtcttag 300 
agaagatatc tgggcaggaa gtcaatgaag ctgcttgcga catcgcccga caagtggctg 360 
atgaaggaga tgctttggta gcaggaggag tgagtcagac accttcatac cttagctgca 420 
agagtgaaac tgaagtcaaa aaagtatttc tgcaacagtt agaggtcttt atgaagaaga 480 
acgtggactt cttgattgca gagtattttg aacacgttga agaagctgtg tgggcagttg 540 
aaaccttgat agcatccggt aaacctgtgg cagcaaccat gtgcattggc ccagaaggag 600 
atttgcatgg cgtgcccccc ggcgagtgtg cagtgcgcct ggtgaaagca ggagcatcca 660 
tcattggtgt gaactgccac tttgacccca ccattagttt aaaaacagtg aagctcatga 720 
aggagggctt ggaggctgcc caactgaaag ctcacctgat gagccagccc ttggcttacc 780 
acactcctga ctgcaacaag cagggattca tcgatctccc agaattccca tttggactgg 840 
aacccagagt tgccaccaga tgggatattc aaaaatacgc cagagaggcc tacaacctgg 900 
gggtcaggta cattggcggg tgctgtggat ttgagcccta ccacatcagg gcaattgcag 960 
aggagctggc cccagaaagg ggctttttgc caccagcttc agaaaaacat ggcagctggg 1020 
gaagtggttt ggacatgcac accaaaccct gggttagagc aagggccagg aaggaatact 1080 
gggagaatct tcggatagcc tcaggccggc catacaaccc ttcaatgtca aagccagatg 1140 
gctggggagt gaccaaagga acagccgagc tgatgcagca gaaagaagcc acaactgagc 1200 
agcagctgaa agagctcttt gaaaaacaaa aattcaaatc acagtagcct cgatagaagc 1260 
tatttttgat gaatttctag gtgtttgggt cacagttcct acaaatacgg aaaagggggt 1320 
taaaaagcag tgctttcatg aatgccatcc tacacatatt attgctatta cctgaacaaa 1380 
atagaattac aaatagcact tgataatttt aaagtatgtt - ttagaaattt tcttaggagc 1440 
aaaataagta caaagtaaat cttgaacagg ttcactaagc acccaccctg tgaaaagtat 1500 
tatggaaatc actgcagcac aggaaaagta attcagatgt taatgccact tgaagaagtt 1560 
ggtaggctag caaagaggat gagacatgaa ctgtcataaa ggactcagca accagccagg 1620 
gacagataaa gcgctatgga aaggggcttc caagttcttt tgaacatgac ccttagtaac 1680 
aaacacaatt tatataatga cccagcaaaa cacatcacat cttactgtcg aaattaaatg 1740 
tgtgatccat cctagtattt tctgttccat tccttttcat tctatttcat ttataaaaca 1800 
tgctagttga gacttttcaa atggattttt atgacccact actgggtttg gatccacagt 1860 
ttgaaaaata ttgctacaag acacttaagg agaccatcct gtttaagttt attcttataa 1920 
gtaggtcagt catatgagac ctgatcaata aatatccaat acccagagtc ctgctctcag 1980 
agttcttctg tttcgtgacc cacttttcta ccagtaaaag acatagacca atggggagga 2040 
ggggaggaga gatggatatt tcagccctct ccatcctagt caacactgga tccacctagt 2100 
gcctctgggc cataaggctg agcagagtga gcttgtatta gttggtagct tttaaaaaat 2160 
ataataaaaa aaaagtagag attctccaaa ctctagcctg gtttcctaga ttgagaacta 2220 
tgatattttt ctctgataat ttaatatcta ctctcctaca aaagctcaag cctgaagata 22 80 
caagactatt agaagaaaca tgactaccct cagtgtatta gaaaagaggt catgcagctt 2340- 
tctaaacatt attgaattgt ttgagctgtt ttgaaattgt aattcttttc agctattaaa 2400 
aagaagagca atgagaaaaa aaaaaaaaaa aaaaaa 2436 

<210> 28 
<211> 1326 
<212> DNA 

<213> Homo sapiens 
<400> 28 

ttcttttcct ctcttcttct ttcgcggttc agcatgcagg aaaaagacgc ctcctcacaa 60 
ggtttcctgc cacacttcca acatttcgcc acgcaggcga tccatgtggg ccaggatccg 120 
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<210> 29 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR product 
<400> 29 

cccacggtcg gggtacctgg gcgggacgcg ccaggccgac tcccggcga 49 

<210> 30 

<211> 3464 

<212> DNA 

<213> Homo sapiens 

<400> 30 

isiiiiilliii=is:ss: 
Ili iii III lis iii; Isi ;§ 

^ a „ t * at L. t sss; sssss ™ 
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gagcaatgga cctccagggc tgtagtgccc cccatctcac tgtccaccac gttcaagcaa 180 
ggggcgcctg gccagcactc gggttttgaa tatagccgtt ctggaaatcc cactaggaat 240 
tgccttgaaa aagcagtggc agcactggat ggggctaagt actgtttggc ctttgcttca 300 
ggtttagcag ccactgtaac tattacccat cttttaaaag caggagacca aattatttgt 360 
atggatgatg tgtatggagg tacaaacagg tacttcaggc aagtggcatc tgaatttgga 420 
ttaaagattt cttttgttga ttgttccaaa atcaaattac tagaggcagc aattacacca 480 
gaaaccaagc ttgtttggat cgaaaccccc acaaacccca cccagaaggt gattgacatt 540 
gaaggctgtg cacatattgt ccataagcat ggagacatta ttttggtcgt ggataacact 600 
tttatgtcac catatttcca gcgccctttg gctctgggag ctgatatttc tatgtattct 660 
gcaacaaaat acatgaatgg ccacagtgat gttgtaatgg gcctggtgtc tgttaattgt 720 
gaaagccttc ataatagact tcgtttcttg caaaactctc ttggagcagt tccatctcct 780 
attgattgtt acctctgcaa tcgaggtctg aagactctac atgtccgaat ggaaaagcat 840 
ttcaaaaacg gaatggcagt tgcccagttc ctggaatcta atccttgggt agaaaaggtt 900 
atttatcctg ggctgccctc tcatccacag catgagttgg tgaagcgtca gtgtacaggt 960 
tgtacaggga tggtcacctt ttatattaag ggcactcttc agcatgctga gattttcctc 1020 
aagaacctaa agctatttac tctggccgag agcttgggag gattcgaaag ccttgctgag 1080 
cttccggcaa tcatgactca tgcatcagtt cttaagaatg acagagatgt ccttggaatt 1140 
agtgacacac tgattcgact ttctgtgggc ttagaggatg aggaagacct actggaagat 1200 
ctagatcaag ctttgaaggc agcacaccct ccaagtggaa ttcacagcta gtattccaga 1260 
gctgctatta gaagctgctt cctgtgaaga tcaatcttcc tgagtaatta atggaccaac 1320 
aatgag 1326 

<210> 29 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR product 
<400> 29 

cccacggtcg gggtacctgg gcgggacgcg ccaggccgac tcccggcga 49 

<210> 30 

<211> 3464 

<212> DNA 

<213> Homo sapiens 

<400> 30 

tttaatggac acataattta attatatatt ttttcttaca gatacccagg tgttctctct 60 
gatgtccagg aggagaaagg cattaagtac aaatttgaag tatatgagaa gaatgattaa 120 
tatgaaggtg ttttctagtt taagttgttc cccctccctc tgaaaaaagt atgtattttt 180 
acattagaaa aggttttttg ttgactttag atctataatt atttctaagc aactagtttt 240 
tattccccac tactcttgtc tctatcagat accatttatg agacattctt gctataacta 300 
agtgcttctc caagacccca actgagtccc cagcacctgc tacagtgagc tgccattcca 360 
cacccatcac atgtggcact cttgccagtc cttgacattg tcgggctttt cacatgttgg 420 
taatatttat taaagatgaa gatccacata cccttcaact gagcagtttc actagtggaa 480 
ataccaaaag cttcctacgt gtatatccag aggtttgtag ataaatgttg ccaccttgtt 540 
tgtaacagtg aaaaattgaa aacaacctgg aagtccagtg atgggaaaat gagtatgttt 600 
ctgtcttaga ttggggaacc caaagcagat tgcaagactg aaatttcagt gaaagcagtg 660 
tatttgctag gtcataccag aaatcatcaa ttgaggtacg gagaaactga actgagaagg 720 
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taagaaaagc aatttaaagt cagcgagcag gttctcattg ataacaagct ccatactgct 780 

gagatacagg gaaatggagg ggggaaagct ggagtattga tcccgccccc ctccttggtt 840 

gtcagctccc tgtcctgtgt gtgggcggaa catagtccag ctgctctata gcaagtctca 900 

ggtgtttgca gtaagaagct gctggcatgc acgggaacag tgaatgccaa acacttaaag 960 

caattcgatg tttaagtatg taagttcttt tttttttaga cagcgtttcg ctcttgttgc 1020 

ccaggctagc atgcaatggt gtgacctcgg cttactgcaa cctccgcctt cccagattca 1080 

agcgattctc ctgcctcagg ctcccaagta gctaggacca ggtgcgcgcc accacgcccg 1140 

gctaattttt gtattttgta tttttagtag agatggggtt tcaccatgtt ggtcaggcta 1200 

gtctcgaact cgtgaccgca agcgattcac ccacctcagc ctcccaaagt gctgggatta 1260 

ccggcttgag ccaccacacc cggcacatct tcattctttt tatgtagtaa aaagtataag 1320 

gccacacatg gtttatttga agtattttat aatttaaaaa aatacagaag caggaaaacc 1380 

aattataagt tcaagtgagg gatgatggtt gcttgaacca aagggttgca tgtagtaaga 1440 

aattgtgatt taagatatat tttaaagtta taagtagcag gatattctga tggagtttga 1500 

ctttggtttt gggcccaggg agtttcagat gcctttgaga aatgaatgaa gtagagagaa 1560 
aataaaagaa aaaccagcca ggcacagtgg ctcacacctg taatcccagc gctttgggag^ 1620 

gctaaggcag gcagatcact tgagaccagc ttgggcaaca tggcaaagcc ccatctctac 1680 

aaaaaacaca aaaattagct gggcattgtg gcgcacacct gtattcccat ctagtcagga 1740 

agctgagatg gaagaattaa ttgagcccac gagttcaagg ctgcagtgag tcgtgattgt 1800 

gccactgcac tccagccggg gtgacagaag agaccttgtc tcgaaaacga atctgaaaac 1860 

aatggaacca tgccttcata attctagaaa gttattttca actgataaat ctatattcac 1920 

ccaaataatc aagggtgaag gtaaaataat acatttttag acaagcaaag actcaggggt 1980 

tacctccatg tgcccttttt agggaagctg ttggagaaaa tactccagca aaatgaagga 2040 

gtacacaaac cagagaatga catgaatcca gcaaatagga tccaacacag gcaatattcc 2100 
agctatggag ctagctttaa aaaggaacag taaaaatatt aatcggttag ctgggtggaa. 2160 

tggcccatgc ctgtagtccc agctactcag gaggctcagc agcaggacga cttgagccca 2220 

agagttccag accagcctgg ccaccttagt gagatccctt ctcttaaaaa taataactta 2280 

ttgccagatt tggggcattt ggaaagaagt tcattgaaga taaagcaaaa gtaaaaaaaa 2340 

aaaaaaaaaa aacaagggga aagggttggt taggcaatca ttctagggca gaaagaagta 2400 

caggatagga agagcataat acactgtttt tctcaacaag gagcagtatg tacacagtca 2460 

taatgatgtg actgcttagc ccctaaatat ggtaactact ctgggacaat atgggaggaa 2520 

aagtgaagat tgtgatggtg taagagctaa tcctcatctg tcatatccag aaatcactat 2580 

ataatatata ataatgaaat gactaagtta tgtgaggaaa aaaacagaag acattgctaa 2640 

aagagttaaa agtcattgct ctggagaatt aggagggatg gggcagggga ctgttaggat 2700 

gcattataaa ctgaaaagcc tttttaaaat tttatgtatt aatatatgca ttcacttgaa 2760 

aaactaaaaa aaaacaataa tttggaaaaa cccatgaagg taactaacgg aaggaaaaac 2820 

taagagaatg aaaagtattt gcctctggaa agaacaactg gcaggactgt tgttttcatt 2880 

gtaagacttt tggagccatt taattgtact taaccatttt catctatttc tttaataaga 2940 

acaattccat cttaataaag agttacactt gttaataagt gctggcctcc tgttgttctt 3000 

tgtacacccc acacaaaatt tcaaagaaac tttgatggca atatatctcc atggtcagct 3060 

taaaaataga gaaaggaaaa catagaatta gccaagagtc acacaaaaca aagatcagtt 3120 

gtttgttagg aaacaatcaa aatcaagtct cactttttcc agattggctt atggaacagc 3180 

actgtaaggt gataacttgg ggcaaacatg taaataataa aacatatgtt ttaaatattc 3240 

aggttagcac attttatgtt tctgtgagat ■ taaaattgtg tgtgacatac ccgcttcctt 3300 

aaaggcaatg tttctgaaaa tgttgtacct gctattcctg aatcagggat gggtcccaga 33 60 

atctgccttt taaacatctc agataatctg aagcctgctt aagtttgtaa ggcactgctt 3420 

ttgcactcta aggaagaaaa aaacaagttt taattcccgt ctct 3464 

<210> 31 
<211> 1584 
<212> DNA 

<213> Homo sapiens 
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taagaaaagc aatttaaagt cagcgagcag gttctcattg ataacaagct ccatactact 780 
gagatapagg gaaatggagg ggggaaagct ggagtattga tcccgccccc ctcctStt ITa 
S T CCt9tgt gtgg ^gaa catagtccag ctgcLtata gcaagtc^a 900 

SSJ °? a f aagaa9Ct 9 ct **atgc acgggaacag tgaatgccaa acaSaaaq 960 
. caattcgatg tttaagtatg taagttcttt tttttttaga cagcgtttcg ctcttSS? rnL 
^SEJT at9Caat9gt ^gacctcgg cttactgcaa cc?ccgcctt cccagattca lw 
gcSttttt aS^^ ag9 " CCCaagta 0ctagg.cc ggtgcgcgcc accacgcccg 1140 
a^^i I gtattttgta tttttagtag agatggggtt tcaccatgtt ggtcaggcta 1200 

EKZS a -™ —a- 1380° 

: aattgtgK a'agatata? S2"^ 9C " gaaCCa ^"gcl tg^gt^ga LIS 
ct .^ 9 „ 9a " taa S atata t tttaaagtta taagtagcag gatattctga tggagtttqa 1500 
aataaaaaL 99gCCCaggg ^"cagac gcctttgaga aatgaatgaa gtagagagla Ull 
aa ' aaaagaa aaacca 9^ a ggcacagtgg ctcacacctg taatcccagc gct^tggqaa 1620 
gctaaggcag gcagatcact tgagaccagc ttgggcaaca tggcaaagcc ccatcSac lllo 
aaaaaacaca aaaattagct gggcattgtg gcgcacacct gSttcccat ctaotcagg^ l^o 
agctgagatg gaagaattaa ttgagcccac gagttcaagg ctgcagtgag tcglSSt Hqo 

aatgg aS £c3SS EST'** tcgaaaa ^ a StSaaK XSo 

^llltt ° tsccttcata attetagaaa gttattttca actgataaat ctatattcac i<J:>n 
^ aaataa * C a£ Wgaag gtaaaataat acatttttag acaagcaaag actcaqaqqt 19&0 
gtacacaaa? £T v" agggaagctg "ggagaaaa tactccagc aaatgaaS 2040 
gSS SSSS ™ag =2SS aSggS fig 

Si SSI ™ ssss == sSss 

ccgccagatt tggggcattt ggaaagaagt tcattgaaga taaagcaaaa ataaa aaaaa 

iilipiiiiiiiiii 

aagagttaaa a"«2??^£ gaCtaagtta ^gaggaaa aaaacagaag acattgctaa 2640 

■ASK 2SSS EKE E2S2 -~ -° 

-SgL^g StSt Sl""^ CCC3tgaagg Caactaa ^ 282 
gtaagac^ ' ?™? SSS SSSS SaaSga SIX 

^a S SS tc 9ttaCaCtt 9ttaataa9t gctggC -- S"t a c? 
taaa! 9 ^ acacaaaa tt tcaaagaaac tttgatggca atatatctcc atggtcagct 3060 

otJtat^oo 9 " aaaa cata 9 aatta gccaagagtc acacaaaaca aagatcagtt 3120 

=i ~S « =35S ~ I 

: ~ -™ ssssss 

tSSSS ESSE £r*" — « •«* 

<210> 31 
<211> 1584 
<212> DNA 

<213> Homo sapiens 
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<400> 31 

cggggcagct ctgaggaaca aggtggaagc tcagagcgct ggtctccacc ctggtgcccc 60 
tgggctggtg ctggcagtgg gagccgtggc tgtggatgag agacatagac gagagagtga 120 
gatggcctgg tttgccctct acctcctgag ccttctctgg gctacagctg ggactagtac 180 
ccagacccag agttcatgct ccgttccctc agcacaggag cccttggtca atggaataca 240 
agtactcatg gagaactcgg tgacttcatc agcctaccca aaccccagca tcctgattgc 300 
catgaatctg gccggagcct acaacttgaa ggcccagaag ctcctgactt accagctcat 360 
gtccagcgac aacaacgatc taaccattgg gcacctcggc ctcaccatca tggccctcac 420 
ctcctcctgc cgagaccctg gggataaagt atccattcta caaagacaaa tggagaactg 480 
ggcaccttcc agccccaacg ctgaagcatc agccttctat gggcccagtc tagcgatctt 540 
ggcactgtgc cagaagaact ctgaggcgac cttgccgata gccgtccgct ttgccaagac 600 
cctgctggcc aactcctctc ccttcaatgt agacacagga gcaatggcaa ccttggctct 660 
gacctgtatg tacaacaaga tccctgtagg ttcagaggaa ggttacagat ccctgtttgg 720 
tcaggtacta aaggatattg tggagaaaat cagcatgaag atcaaagata atggcatcat 780 
tggagacatc tacagtactg gcctcgccat gcaggctctc tctgtaacac ctgagccatc 840 
taaaaaggaa tggaactgca agaagactac ggatatgata ctcaatgaga ttaagcaggg 900 
gaaattccac aaccccatgt ccattgctca aatcctccct tccctgaaag gcaagacata .960 
cctagatgtg ccecaggtca cttgtagtcc tgatcatgag gtacaaccaa ctctacccag 1020 
caaccctggc cctggcccca cctctgcatc taacatcact gtcatataca ccataaataa 1080 
ccagctgagg ggggttgagc tgctcttcaa cgagaccatc aatgttagtg tgaaaagtgg 1140 
gtcagtgtta cttgttgtcc tagaggaagc acagcgcaaa aatcctatgt tcaaatttga 1200 
aaccacaatg acatcttggg gccttgtcgt ctcttctatc aacaatatcg cggaaaatgt 1260 
taatcacaag acatactggc agtttcttag tggtgtaaca cctttgaatg aaggggttgc 1320 
tgactacata cccttcaacc acgagcacat cacagccaat ttcacacagt actaacgaag 1380 
aggtgggttc agcttctatc aaacatctcc aaaggatggg tgaaattttt tccacttcat 1440 
tttaaatcta tgcaaaaaag cgaatgcctg tgatgctacc atattcctgg taaaaacatg 1500 
gagaaccact atgtagaata aaaatgcaaa gttcactgga gtctcaacat ctatgactca 1560 
tgaaaataaa attttcatct tctc 1584 

<210> 32 

<211> 1537 

<212> DNA 

<213> Homo sapiens 

<400> 32 

gctctcatta ccttctgccc atcacttaat aaatagccag ccaattcatc aacattctgg 60 
tacactgttg gagagatgag acagtcacac cagctgcccc tagtggggct cttactgttt 120 
tcttttattc caagccaact atgcgagatt tgtgaggtaa gtgaagaaaa ctacatccgc 180 
ctaaaacctc tgttgaatac aatgatccag tcaaactata acaggggaac cagcgctgtc 240 
aatgttgtgt tgtccctcaa acttgttgga atccagatcc aaaccctgat gcaaaagatg 300 
atccaacaaa tcaaatacaa tgtgaaaagc agattgtcag atgtaagctc gggagagctt 360 
gccttgatta tactggcttt gggagtatgt cgtaacgctg aggaaaactt aatatatgat 420 
taccacctga ctgacaagct agaaaataaa ttccaagcag aaattgaaaa tatggaagca 480 
cacaatggca ctcccctgac taactactac cagctcagcc tggacgtttt ggccttgtgt 540 
ctgttcaatg ggaactactc aaccgccgaa gttgtcaacc acttcactcc tgaaaataaa 600 
aactattatt ttggtagcca gttctcagta gatactggtg caatggctgt cctggctctg 660 
acctgtgtga agaagagtct aataaatggg cagatcaaag cagatgaagg cagtttaaag 720 
aacatcagta tttatacaaa gtcactggta gaaaagattc tgtctgagaa aaaagaaaat 780 
ggtctcattg gaaacacatt tagcacagga gaagccatgc aggccctctt tgtatcatca 840 
gactattata atgaaaatga ctggaattgc caacaaactc tgaatacagt gctcacggaa 900 
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<400> 31 

cggggcagct ctgaggaaca aggtggaagc 
tgggctggtg ctggcagtgg gagccgtggc 
gatggcctgg tttgccctct acctcctgag 
ccagacccag agttcatgct ccgttccctc 
agtactcatg gagaactcgg tgacttcatc 
catgaatctg gccggagcct acaacttgaa 
gtccagcgac aacaacgatc taaccattgg 
ctcctcctgc cgagaccctg gggataaagt 
ggcaccttcc agccccaacg ctgaagcatc 
ggcactgtgc cagaagaact ctgaggcgac 
cctgctggcc aactcctctc ccttcaatgt 
gacctgtatg tacaacaaga tccctgtagg 
tcaggtacta aaggatattg tggagaaaat 
tggagacatc tacagtactg gcctcgccat 
taaaaaggaa tggaactgca agaagactac 
gaaattccac aaccccatgt ccattgctca 
cctagatgtg ccccaggtca cttgtagtcc 
caaccctggc cctggcccca cetctgcatc 
ccagctgagg ggggttgagc tgctettcaa 
gtcagtgtta cttgttgtcc tagaggaagc 
aaccacaatg acatcttggg gccttgtcgt 
taatcacaag acatactggc agtttcttag 
tgactacata cccttcaacc acgagcacat 
aggtgggttc agcttctatc aaacatctcc 
tttaaatcta tgcaaaaaag cgaatgcctg 
gagaaccact atgtagaata aaaatgcaaa 
tgaaaataaa attttcatct tctc 



tcagagcgct ggtctccacc ctggtgcccc 60 
tgtggatgag agacatagac gagagagtga 120 
ccttctctgg gctacagctg ggactagtac 180 
agcacaggag cccttggtca atggaataca 240 
agcctaccca aaccccagca tcctgattgc 300 
ggcccagaag ctcctgactt accagctcat 360 
gcacctcggc ctcaccatca tggccctcac 420 
atccattcta caaagacaaa tggagaactg 480 
agccttctat gggcccagtc tagcgatctt 540 
cttgccgata gccgtccgct ttgccaagac 600 
agacacagga gcaatggcaa ccttggctct 660 
ttcagaggaa ggttacagat ccctgtttgg 720 
cagcatgaag atcaaagata atggcatcat 780 
gcaggctctc tctgtaacac ctgagccatc 840 
ggatatgata ctcaatgaga ttaagcaggg 900 
aatcctccct tccctgaaag gcaagacata 960 
tgatcatgag gtacaaccaa ctctacccag 1020 
taacatcact gtcatataca ccataaataa 1080 
cgagaccatc aatgttagtg tgaaaagtgg 1140 
acagcgcaaa aatcctatgt tcaaatttga 1200 
ctcttctatc aacaatatcg cggaaaatgt 1260 
tggtgtaaca cctttgaatg aaggggttgc 1320 
cacagccaat ttcacacagt actaacgaag 1380 
aaaggatggg tgaaattttt tccacttcat 1440 
tgatgctacc atattcctgg taaaaacatg 1500 
gttcactgga gtctcaacat ctatgactca 1560 

1584 



<210> 32 
<211> 1537 
<212> DNA 

<213> Homo sapiens. 



<400> 32 

gctctcatta ccttctgccc atcacttaat 
tacactgttg gagagatgag acagtcacac 
tcttttattc caagccaact atgcgagatt 
ctaaaacctc tgttgaatac aatgatccag 
aatgttgtgt tgtccctcaa acttgttgga 
atccaacaaa tcaaatacaa tgtgaaaagc 
gccttgatta tactggcttt gggagtatgt 
taccacctga ctgacaagct agaaaataaa 
cacaatggca ctcccctgac taactactac 
ctgttcaatg ggaactactc aaccgccgaa 
aactattatt ttggtagcca gttctcagta 
acctgtgtga agaagagtct aataaatggg 
aacatcagta tttatacaaa gtcactggta 
ggtctcattg gaaacacatt tagcacagga 
gactattata atgaaaatga ctggaattgc 



aaatagccag ccaattcatc aacattctgg 60 
cagctgcccc tagtggggct cttactgttt 120 
tgtgaggtaa gtgaagaaaa ctacatccgc 180 
tcaaactata acaggggaac cagcgctgtc 240 
atccagatcc aaaccctgat gcaaaagatg 300 
agattgtcag atgtaagctc gggagagctt 360 
cgtaacgctg aggaaaactt aatatatgat 420 
ttccaagcag aaattgaaaa tatggaagca 480 
cagctcagcc tggacgtttt ggccttgtgt 540 
gttgtcaacc acttcactcc tgaaaataaa 600 
gatactggtg caatggctgt cctggctctg 660 
cagatcaaag cagatgaagg cagtttaaag 720 
gaaaagattc tgtctgagaa aaaagaaaat 780 
gaagccatgc aggccctctt tgtatcatca 840 
caacaaactc tgaatacagt gctcacggaa 900 
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atttctcaag gagcattcag taatccaaac 
ggaaagacct tcttggatat taacaaagac 
aacatctccg ctgatgagcc tataactgtg 
gtcaattact ctgtgagaat caatgaaaca 
tctgtcttcc tcagtgtgat ggagaaagcc 
acaatggagg agcgctcatg ggggccctat 
aataatgaca gaacctactg ggaacttctg 
ggtagttacg ttgtccgcaa tggagaaaac 
gcccaaactt tcctcagctg cataaaatcc 
ttatgccttc ttcttcattt atcccagtac 
tctctacatg ttcaataaaa gttgttgaaa 



gctgcagccc aggtcttacc tgccctgatg 960 
tcttcttgcg tctctgcttc aggtaacttc 1020 
acacctcctg actcacaatc atatatctcc 1080 
tatttcacca atgtcactgt gctaaatggt 1140 
cagaaaatga atgatactat atttggtttc 1200 
atcacctgta ttcagggcct atgtgccaac 1260 
agtggaggcg aaccactgag ccaaggagct 1320 
ttggaggttc gctggagcaa atactaataa 1380 
atttgcagtg gagttccatg tttattgtcc 1440 
gagcaggaga gttaataacc tccccttctc 1500 
gattaac 1537 



<210> 33 

<211> 1866 

<212> DNA 

<213> Homo sapiens 



<400> 33 

ccgattcttg ctcactgctc acccacctgc 
cttccttctg ggggtcctgg gggccctcac 
ccatctggta gagaagttgg gccagcacct 
gcacttgaac cccagcatct atgtgggcct 
ggaagacctc tacctgcaca gcctcaagct 
cttcagcgag gatgacggtg actgccaggg 
cctgctcgct ctcagagcca actgtgagtt 
ctcacagctc aaatggttcc tggaggatga 
ccacccccac actagctact accagtatgg 
gaagcgggtc catgacagcg tggtggacaa 
gggccaccat tctgtggaca cagcagccat 
ctcaaacttc aaccctggtc ggagacaacg 
ggagatcttg aaggcccaga cccccgaggg 
ggcattacag ttcctcatga cttcccccat 
caaggcgagg gttgctttgc tggccagtct 
gatttcccag ctgctgcccg ttctgaacca 
ctgtctggca ccacgagtca tgttggaacc 
gatcatcagt gtcacgctgc aggtgcttag 
tgttctggcc gggtccaccg tggaagatgt 
cacatatgaa acacaggcct cctcgtcagg 
ggccggagaa agggagttct ggcagcttct 
tattgctgac tacagaccca aggatggaga 
gcccctgagc tccctcatcc cagcagcctc 
tgatgtccct ggaacaggaa ctcgcctgac 
atgccccctg ggatcacccc agccacaagc 
tggagcagag agccaagcat cttccctggg 
gccctgcagg tctcccatga aggccacccc 
tccttggcaa aaaacggagt ccgcaggccg 
ttggggtcct gcaagaaggc ctcctcagcc 
actctgctgt tagagtggca gctctgagct 
gcagggctgc tcagtgcctg cctctgacaa 
aaaaaa 



tgctgccatg aggcaccttg gggccttcct 60 

tgagatgtgt gaaataccag agatggacag 120 

cttaccttgg atggaccggc tttccctgga 180 

acgcctctcc agtctgcagg ctgggaccaa 240 

tggttaccag cagtgcctcc tagggtctgc 300 

caagccttcc atgggccagc tggccctcta 360 

tgtcaggggc cacaaggggg acaggctggt 420 

gaagagagcc attgggcatg atcacaaggg 480 

cctgggcatt ctggccctgt gtctccacca 540 

acttctgtat gctgtggaac ctttccacca 600 

ggcaggcttg gcattcacct gtctgaagcg 660 

gatcaccatg gccatcagaa cagtgcgaga 720 

ccactttggg aatgtctaca gcaccccatt 780 

gcctggggca gaactgggaa cagcatgtct 840 

gcaggatgga gccttccaga atgctctcat 900 

caagacctac attgatctga tcttcccaga 960 

agctgctgag accattcctc agacccaaga 1020 

tctcttgccg ccgtacagac agtccatctc 1080 

cctgaagaag gcccatgagt taggaggatt 1140 

cccctactta acctccgtga tggggaaagc 1200 

ccgagacccc aacaccccac tgttgcaagg 1260 

aaccattgag ctgaggctgg ttagctggta 1320 

gcacactccc taggcttcta ccctccctcc 1380 

cctgctgcca cctcctgtgc actttgagca 144 0 

ccttcgaggg ccctatacca tggcccacct 1500 

aagtctttct ggccaagtct ggccagcctg 1560 

atggtctgat gggcatgaag catctcagac 1620 

caggtgttgt gaagaccact cgttctgtgg 1680 

cgggggctat ggccctgacc ccagctctcc 1740 

ggttgtggca cagtagctgg ggagacctca 1800 

aattaaagca ttgatggcct gtggacctgc 1860 

1866 
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atttctcaag gagcattcag taatccaaac 
ggaaagacct tcttggatat taacaaagac 
aacatctccg ctgatgagcc tataactgtg 
gtcaattact ctgtgagaat caatgaaaca 
tctgtcttcc tcagtgtgat ggagaaagcc 
acaatggagg agcgctcatg ggggccctat 
aataatgaca gaacctactg ggaacttctg 
ggtagttacg ttgtccgcaa tggagaaaac 
gcccaaactt tcctcagctg cataaaatcc 
ttatgccttc ttcttcattt atcccagtac 
tctctacatg ttcaataaaa gttgttgaaa 



gctgcagccc aggtcttacc tgccctgatg 960 
tcttcttgcg tctctgcttc aggtaacttc 1020 
acacctcctg actcacaatc atatatctcc 1080 
tatttcacca atgtcactgt gctaaatggt 1140 
cagaaaatga atgatactat atttggtttc 1200 
atcacctgta ttcagggcct atgtgccaac 1260 
agtggaggcg aaccactgag ccaaggagct 1320 
ttggaggttc gctggagcaa atactaataa 1380 
atttgcagtg gagttccatg tttattgtcc 1440 
gagcaggaga gttaataacc tccccttctc 1500 
gattaac 1537 



<210> 33 
<211> 1866 
<212> DNA 

<213> Homo sapiens 



<400> 33 

ccgattcttg ctcactgctc *acccacctgc 
cttccttctg ggggtcctgg gggccctcac 
ccatctggta gagaagttgg gccagcacct 
gcacttgaac cccagcatct atgtgggcct 
ggaagacctc tacctgcaca gcctcaagct 
cttcagcgag gatgacggtg actgccaggg 
cctgctcgct ctcagagcca actgtgagtt 
ctcacagctc aaatggttcc tggaggatga 
ccacccccac actagctact accagtatgg 
gaagcgggtc catgacagcg tggtggacaa 
gggccaccat tctgtggaca cagcagccat 
ctcaaacttc aaccctggtc ggagacaacg 
ggagatcttg aaggcccaga cccccgaggg 
ggcattacag ttcctcatga cttcccccat 
caaggcgagg gttgctttgc tggccagtct 
gatttcccag ctgctgcccg ttctgaacca 
ctgtctggca ccacgagtca tgttggaacc 
gatcatcagt gtcacgctgc aggtgcttag 
tgttctggcc gggtccaccg tggaagatgt 
cacatatgaa acacaggcct cctcgtcagg 
ggccggagaa agggagttct ggcagcttct 
tattgctgac tacagaccca aggatggaga 
gcccctgagc tccctcatcc cagcagcctc 
tgatgtccct ggaacaggaa ctcgcctgac 
atgccccctg ggatcacccc agccacaagc 
tggagcagag agccaagcat cttccctggg 
gccctgcagg tctcccatga aggccacccc 
tccttggcaa aaaacggagt ccgcaggccg 
ttggggtcct gcaagaaggc ctcctcagcc 
actctgctgt tagagtggca gctctgagct 
gcagggctgc tcagtgcctg cctctgacaa 
aaaaaa 



tgctgccatg aggcaccttg gggccttcct 60 
tgagatgtgt gaaataccag agatggacag 120 
cttaccttgg atggaccggc tttccctgga 180 
acgcctctcc agtctgcagg ctgggaccaa 240 
tggttaccag cagtgcctcc tagggtctgc 300 
caagccttcc atgggccagc tggccctcta 360 
tgtcaggggc cacaaggggg acaggctggt 420 
gaagagagcc attgggcatg atcacaaggg 480 
cctgggcatt ctggccctgt gtctccacca 540 
acttctgtat gctgtggaac ctttccacca 600 
ggcaggcttg gcattcacct gtctgaagcg 660 
gatcaccatg gccatcagaa cagtgcgaga 720 
ccactttggg aatgtctaca gcaccccatt 780 
gcctggggca gaactgggaa cagcatgtct 840 
gcaggatgga gccttccaga atgctctcat 900 
caagacctac attgatctga tcttcccaga 960 
agctgctgag accattcctc agacccaaga 1020 
tctcttgccg ccgtacagac agtccatctc 1080 
cctgaagaag gcccatgagt taggaggatt 1140 
cccctactta acctccgtga tggggaaagc 1200 
ccgagacccc aacaccccac tgttgcaagg 1260 
aaccattgag ctgaggctgg ttagctggta 1320 
gcacactccc taggcttcta ccctccctcc 1380 
cctgctgcca cctcctgtgc actttgagca 1440 
ccttcgaggg ccctatacca tggcccacct 1500 
aagtctttct ggccaagtct ggccagcctg 1560 
atggtctgat gggcatgaag catctcagac 1620 
caggtgttgt gaagaccact cgttctgtgg 1680 
cgggggctat ggccctgacc ccagctctcc 1740 
ggttgtggca cagtagctgg ggagacctca 1800 
aattaaagca ttgatggcct gtggacctgc 1860 

1866 
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<210> 34 
<211> 2798 
<212> DNA 

<213> Homo sapiens 
<400> 34 

gccctctccc acagcggagt ccaaaacagg cctaccagtc agttcttatt tctattgggt 60 
gtttccatgc tccaccatgt taagagctaa gaatcagctt tttttacttt cacctcatta 120 
cctgaggcag gtaaaagaat catcaggctc caggctcata cagcaacgac ttctacacca 180 
gcaacagccc cttcacccag aatgggctgc cctggctaaa aagcagctga aaggcaaaaa 240 
cccagaagac ctaatatggc acaccccgga agggatctct ataaaaccct tgtattccaa 300 
gagagatact atggacttac ctgaagaact tccaggagtg aagccattca cacgtggacc 360 
atatcctacc atgtatacct ttaggccctg gaccatccgc cagtatgctg gttttagtac 420 
tgtggaagaa agcaataagt tctataagga caacattaag gctggtcagc agggattatc 480. 
agttgccttt gatctggcga cacatcgtgg ctatgattca gacaaccctc gagttcgtgg 540 
tgatgttgga atggctggag ttgctattga cactgtggaa gataccaaaa ttctttttga 600 
tggaattcct ttagaaaaaa tgtcagtttc catgactatg aatggagcag ttattccagt 660 
tcttgcaaat tttatagtaa ctggagaaga acaaggtgta cctaaagaga aacttactgg 72 0: 
taccatccaa aatgatatac taaaggaatt tatggttcga aatacataca tttttcctcc 7 80 
agaaccatcc atgaaaatta ttgctgacat atttgaatat acagcaaagc acatgccaaa 840 
atttaattca atttcaatta gtggatacca tatgcaggaa gcaggggctg atgccattct 900 
ggagctggcc tatactttag cagatggatt ggagtactct agaactggac tccaggctgg 960 
cctgacaatt gatgaatttg caccaaggtt gtctttcttc tggggaattg gaatgaattt 1020 
ctatatggaa atagcaaaga tgagagctgg tagaagactc tgggctcact taatagagaa 1080 
aatgtttcag cctaaaaact caaaatctct tcttctaaga gcacactgtc agacatctgg 1140 
atggtcactt actgagcagg atccctacaa taatattgtc cgtactgcaa tagaagcaat 1200 
ggcagcagta tttggaggga ctcagtcttt gcacacaaat tcttttgatg aagctttggg 1260 
tttgccaact gtgaaaagtg ctcgaattgc caggaacaca caaatcatca ttcaagaaga 1320 
atctgggatt cccaaagtgg ctgatccttg gggaggttct tacatgatgg aatgtctcac 1380 
aaatgatgtt, tatgatgctg ctttaaagct cattaatgaa attgaagaaa tgggtggaat 1440 
ggccaaagct gtagctgagg gaatacctaa acttcgaatt gaagaatgtg ctgcccgaag 1500 
acaagctaga atagattctg gttctgaagt aattgttgga gtaaataagt accagttgga 1560 
aaaagaagac gctgtagaag ttctggcaat tgataatact tcagtgcgaa acaggcagat 1620 
tgaaaaactt aagaagatca aatccagcag ggatcaagct ttggctgaac attgtcttgc 1680 
tgcactaacc gaatgtgctg ctagcggaga tggaaatatc ctggctcttg cagtggatgc. 1740 
atctcgggca agatgtacag tgggagaaat cacagatgcc ctgaaaaagg tatttggtga 1800 
acataaagcg aatgatcgaa tggtgagtgg agcatatcgc caggaatttg gagaaagtaa 1860 
agagataaca tctgctatca agagggttca taaattcatg gaacgtgaag gtcgcagacc 1920 
tcgtcttctt gtagcaaaaa tgggacaaga tggccatgac agaggagcaa aagttattgc 1980 
tacaggattt gctgatcttg gttttgatgt ggacataggc cctcttttcc agactcctcg 2040 
tgaagtggcc cagcaggctg tggatgcgga tgtgcatgct gtgggcgtaa gcaccctcgc 2100 
tgctggtcat aaaaccctag ttcctgaact catcaaagaa cttaactccc ttggacggcc 2160 
agatattctt gtcatgtgtg gaggggtgat accacctcag gattatgaat ttctgtttga 2220 
agttggtgtt tccaatgtat ttggtcctgg gactcgaatt ccaaaggctg ccgttcaggt 2280 
gcttgatgat attgagaagt gtttggaaaa gaagcagcaa tctgtataat atcctctttt 2340 
tgttttagct tttgtctaaa atattatttt agttatgatc aaagaagaga gtaaagctat 2400 
gtcttcaatt taatttcaat acctgatttg tactttcctt gaaagcttta ctttaaaata 24 60 
ccttacttat aggcctggtg tcatgctata agtatgtaca tacagtttca cttcaaaaat 2520 
aaaaaaaaat ccctaaaaac tctctatact ctctataaca atactttatc aagaactctg 2580 
gacaatggta ttatttttaa aaatcatggt gatgtattta ttagaatgtt tcttataaat 2640 
ctctttcatt tttatattaa gaattaaact gtacctaaaa aaactctgac tattcccatt 2700 
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<210> 34 

<211> 2798 

<212> DNA 

<213> Homo sapiens 

<400> 34 

gccctctccc acagcggagt ccaaaacagg cctaccagtc agttcttatt tctattgggt 60 
gtttccatgc tccaccatgt taagagctaa gaatcagctt tttttacttt cacctcatta 120 
cctgaggcag gtaaaagaat catcaggctc caggctcata cagcaacgac ttctacacca 180 
gcaacagccc cttcacccag aatgggctgc cctggctaaa aagcagctga aaggcaaaaa 240 
cccagaagac ctaatatggc acaccccgga agggatctct ataaaaccct tgtattccaa 300 
gagagatact atggacttac ctgaagaact tccaggagtg aagccattca cacgtggacc 360 
atatcctacc atgtatacct ttaggccctg gaccatccgc cagtatgctg gttttagtac 420 
tgtggaagaa agcaataagt tctataagga caacattaag gctggtcagc agggattatc 480 
agttgccttt gatctggcga cacatcgtgg ctatgattca gacaaccctc gagttcgtgg 540 
tgatgttgga atggctggag ttgctattga cactgtggaa gataccaaaa ttctttttga 600 
tggaattcct ttagaaaaaa tgtcagtttc catgactatg aatggagcag ttattccagt 660 
tcttgcaaat tttatagtaa ctggagaaga acaaggtgta cctaaagaga aacttactgg 720 
taccatccaa aatgatatac taaaggaatt tatggttcga aatacataca tttttcctcc 780 
agaaccatcc atgaaaatta ttgctgacat atttgaatat acagcaaagc acatgccaaa 840 
atttaattca atttcaatta gtggatacca tatgcaggaa gcaggggctg atgccattct 900 
ggagctggcc tatactttag cagatggatt ggagtactct agaactggac tccaggctgg 960 
cctgacaatt gatgaatttg caccaaggtt gtctttcttc tggggaattg gaatgaattt 1020 
ctatatggaa atagcaaaga tgagagctgg tagaagactc tgggctcact taatagagaa 1080 
aatgtttcag cctaaaaact caaaatctct tcttctaaga gcacactgtc- agacatctgg 1140 
atggtcactt actgagcagg atccctacaa taatattgtc cgtactgcaa tagaagcaat 1200 
ggcagcagta tttggaggga ctcagtcttt gcacacaaat . tcttttgatg aagctttggg 1260 
tttgccaact gtgaaaagtg ctcgaattgc caggaacaca caaatcatca ttcaagaaga 1320 
atctgggatt cccaaagtgg ctgatccttg gggaggttct tacatgatgg aatgtctcac 13 80 
aaatgatgtt tatgatgctg ctttaaagct cattaatgaa attgaagaaa tgggtggaat 1440 
ggccaaagct gtagctgagg gaatacctaa acttcgaatt gaagaatgtg ctgcccgaag 1500 
acaagctaga atagattctg gttctgaagt aattgttgga gtaaataagt accagttgga 1560 
aaaagaagac gctgtagaag ttctggcaat tgataatact tcagtgcgaa acaggcagat 1620 
tgaaaaactt aagaagatca aatccagcag ggatcaagct ttggctgaac attgtcttgc 1680 
tgcactaacc gaatgtgctg ctagcggaga tggaaatatc ctggctcttg cagtggatgc 1740 
atctcgggca agatgtacag tgggagaaat cacagatgcc ctgaaaaagg tatttggtga 1800 
acataaagcg aatgatcgaa tggtgagtgg agcatatcgc caggaatttg gagaaagtaa 1860 
agagataaca tctgctatca agagggttca taaattcatg gaacgtgaag gtcgcagacc 1920 
tcgtcttctt gtagcaaaaa tgggacaaga tggccatgac agaggagcaa aagttattgc 1980 
tacaggattt gctgatcttg gttttgatgt ggacataggc cctcttttcc agactcctcg 2040 
tgaagtggcc cagcaggctg tggatgcgga tgtgcatgct gtgggcgtaa gcaccctcgc 2100 
tgctggtcat aaaaccctag ttcctgaact catcaaagaa cttaactccc ttggacggcc 2160 
agatattctt gtcatgtgtg gaggggtgat accacctcag gattatgaat ttctgtttga 2220 
agttggtgtt tccaatgtat ttggtcctgg gactcgaatt ccaaaggctg ccgttcaggt 2280 
gcttgatgat attgagaagt gtttggaaaa gaagcagcaa tctgtataat atcctctttt 2340 
tgttttagct tttgtctaaa atattatttt agttatgatc aaagaagaga gtaaagctat 2400* 
gtcttcaatt taatttcaat acctgatttg tactttcctt gaaagcttta ctttaaaata 2460 
ccttacttat aggcctggtg tcatgctata agtatgtaca tacagtttca cttcaaaaat 2520 
aaaaaaaaat ccctaaaaac tctctatact ctctataaca atactttatc aagaactctg 2580 
gacaatggta ttatttttaa aaatcatggt gatgtattta ttagaatgtt tcttataaat 2640 
ctctttcatt tttatattaa gaattaaact gtacctaaaa aaactctgac tattcccatt 2700 
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tctcagttta gcattacatt gtcttgagca ccagaaaata aaatccatat attaattaaa 27 60 
acctatcttg aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2798 

<210> 35 
<211> 1637 
<212> DNA 

<213> Homo sapiens 



<400> 35 

aagaactggc 

ccaaacactc 

tcttgagctg 

ggatgagagg 

cttttatatt 

tgaaaatgcc 

atatcttgaa 

gaagcgtcct 

agccaatgag 

gttatcattt 

cccttctgat 

aagtatgcgg 

tgaagtaatt 

ttacactgga 

ttatgttggc 

gggagttgat 

tgctggtgcc 

gtttggccat 

ggtctgtgga 

tttagagatc 

tggctatctg 

accagttgtg 

aacattttct 

tgacaagcgg 

tgatgtttat 

ttagcagtgt 

agtattattc 

taaataatat 



ctgtacattt 

cattgggatc 

ccggctgaca 

gtggctctcc 

cccaaaatac 

atctatttct 

gaagaactag 

tggattacag 

aaagaaatag 

tttaagccta 

cattatgcta 

atgataaagc 

gagaaggaag 

cagcacttta 

tttgatctag 

tttgcctgct 

ttcattcatg 

gaactcagca 

ttccgaattt 

tttaagcaag 

gaatacctga 

aacataatta 

gttccaaaca 

aatccaaatg 

aaatttacca 

tttctagaac 

gatttttaat 

accttac 



tcaaggaatt 

ctagctgttt 

cagtgcagcg 

acctagatga 

aggatctgcc 

tgggaaattc 

ataagtgggc 

gagatgagag 

ccctaatgaa 

cgccaaaacg 

ttgagtcaca 

caagagaggg 

gagactcaat 

atattcctgc 

cacatgcagt 

ggtgttccta 

aaaagcatgc 

ccagatttaa 

caaatcctcc 

cgacaatgaa 

tcaagcataa 

ctccgtctca 

aagatgtttt 

gcattcgagt 

atctgctcac 

aacttaagca 

tattgaaagt 



cttgagaggt 

tagagaacaa 

cattgcggct 

ggaagataag 

tccagttgat 

tcttggcctt 

caaaatagca 

tattgtaggc 

tgctttgact 

atataaaatt 

actacaactt 

ggaagaaacc 

tgcagtgatc 

catcacaaaa 

tggaaatgtt 

caagtattta 

ccatacgatt 

gatggataac 

cattttgttg 

ggcattgcgg 

ctatggcaaa 

tgtagaggag 

ccaagaacta 

ggctccagtt 

ttctatactt 

aattatactg 

atgtcaccat 



tcttggagag 
cttgtaatgg 
gaactcaaat 
ctgaggcact 
ttatcattag 
caaccaaaaa 
gcctatggtc 
cttatgaagg 
gtaaatttac 
cttctagaag 
cacggactta 
ttaagaatag 
ctgttcagtg 
gctggacaag 
gaactctact 
aatgcaggag 
aaacctgcat 
aaactgcagt 
gtctgttcct 
aaaaaatctg 
gataaagcag 
cgggggtgcc 
gaaaaaagag 
cctctctata 
gactctgcag 
aaagctgctg 
tgaccacatg 



attctgggag 
agccttcatc 
gccacccaac 
tcagggagtg 
tgaataaaga 
tggttaaaac 
atgaagtggg 
acattgtagg 
atcttctaat 
ccaaagcctt 
acattgaaga 
aggatatcct 
gggtgcattt 
cgaagggttg 
tacatgactg 
caggaggaat 
tagtgggatg 
taatccctgg 
tgcatgctag 
ttttgctaac 
caaccaagaa 
agctaacaat 
gagtggtttg 
attctttcca 
aaacaaaaaa 
tggttatttc 
taactaacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1637 



<210> 36 
<211> 1908 
<212> DNA 

<213> Homo sapiens 



<400> 36 

gaattcatga 

gagaaatata 

cacggtaggt 

ctgtgcaggt 

tttaaaaaca 

ttagtcacat 

taatctaaac 



aaacgtagct 
tacgaaagga 
ggctaaacac 
gacccaagtg 
aacaaacgta 
ctataccatc 
tacaaaaagt 



cgtcctcaaa 
acaagatttt 
cagtcttcaa 
aggggtcacc 
cttattgcgt 
ctaagaaact 
gttcactggg 



aaaaacagaa 
gaagcaccca 
tgcccttcca 
cttgggcttt 
tgaaggacgg 
ttatccaccc 
gaactgcaca 



gaggagtaat 
agctgccacc 
cagcctcagt 
tcctgtggca 
caacaggaag 
aaactgtatt 
atatgactgc 



cattttaagg 60 
tacattaaaa 120 
ctgaaaaata 180 
gtatctctgg 240 
gactccatga 300 
tcagacttta 360 
ttttaaccgt 420 
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tctcagttta gcattacatt gtcttgagca ccagaaaata aaatccatat attaattaaa 2760 
acctatcttg aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2798 

<210> 35 

<211> 1637 

<212> DNA 

<213> Homo sapiens 



<400> 35 

aagaactggc 

ccaaacactc 

tcttgagctg 

ggatgagagg 

cttttatatt 

tgaaaatgcc 

atatcttgaa 

gaagcgtcct 

agccaatgag 

gttatcattt 

cccttctgat 

aagtatgcgg 

tgaagtaatt 

ttacactgga 

ttatgttggc 

gggagttgat 

tgctggtgcc 

gtttggccat 

ggtctgtgga 

tttagagatc 

tggctatctg 

accagttgtg 

aacattttct 

tgacaagcgg 

tgatgtttat 

ttagcagtgt 

agtattattc 

taaataatat 



ctgtacattt 

cattgggatc 

ccggctgaca 

gtggctctcc 

cccaaaatac 

atctatttct 

gaagaactag 

tggattacag 

aaagaaatag 

tttaagccta 

cattatgcta 

atgataaagc. 

gagaaggaag 

cagcacttta 

tttgatctag 

tttgcctgct 

ttcattcatg 

gaactcagca 

ttccgaattt 

tttaagcaag 

gaatacctga 

aacataatta 

gttccaaaca 

aatccaaatg 

aaatttacca 

tttctagaac 

gatttttaat 

accttac 



tcaaggaatt 

ctagctgttt 

cagtgcagcg 

acctagatga 

aggatctgcc 

tgggaaattc 

ataagtgggc 

gagatgagag 

ccctaatgaa 

cgccaaaacg 

ttgagtcaca 

caagagaggg 

gagactcaat 

atattcctgc 

cacatgeagt 

ggtgttccta 

aaaagcatgc 

ccagatttaa 

caaatcctcc 

cgacaatgaa 

tcaagcataa 

ctccgtctca 

aagatgtttt 

gcattcgagt 

atctgctcac 

aacttaagca 

tattgaaagt 



cttgagaggt 
tagagaacaa 
cattgcggct 
ggaagataag 
tccagttgat 
tcttggcctt 
caaaatagca 
tattgtaggc 
tgctttgact 
atataaaatt 
actacaactt 
ggaagaaacc 
tgcagtgatc 
catcacaaaa 
tggaaatgtt 
caagtattta 
ccatacgatt 
gatggataac 
cattttgttg 
ggcattgcgg 
ctatggcaaa 
tgtagaggag 
ccaagaacta 
ggctccagtt 
ttctatactt 
aattatactg 
atgtcaccat 



tcttggagag 
cttgtaatgg 
gaactcaaat 
ctgaggcact 
ttatcattag 
caaccaaaaa 
gcctatggtc 
cttatgaagg 
gtaaatttac 
cttctagaag 
cacggactta 
ttaagaatag 
ctgttcagtg 
gctggacaag 
gaactctact 
aatgcaggag 
aaacctgcat 
aaactgcagt 
gtctgttcct 
aaaaaatctg 
gataaagcag 
cgggggtgcc 
gaaaaaagag 
cctctctata 
gactctgcag 
aaagctgctg 
tgaccacatg 



attctgggag 
agccttcatc 
gccacccaac 
tcagggagtg 
tgaataaaga 
tggttaaaac 
atgaagtggg 
acattgtagg 
atcttctaat 
ccaaagcctt 
acattgaaga 
aggatatcct 
gggtgcattt 
cgaagggttg 
tacatgactg 
caggaggaat 
tagtgggatg 
taatccctgg 
tgcatgctag 
ttttgctaac 
caaccaagaa 
agctaacaat 
gagtggtttg 
attctttcca 
aaacaaaaaa 
tggttatttc 
taactaacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1637 



<210> 36 

<211> 1908 

<212> DNA 

<213> Homo sapiens 



<400> 36 

gaattcatga 

gagaaatata 

cacggtaggt 

ctgtgcaggt 

tttaaaaaca 

ttagtcacat 

taatctaaac 



aaacgtagct 
tacgaaagga 
ggctaaacac 
gacccaagtg 
aacaaacgta 
ctataccatc 
tacaaaaagt 



cgtcctcaaa 
acaagatttt 
cagtcttcaa 

aggggtcacc 

cttattgcgt 
ctaagaaact 
gttcactggg 



aaaaacagaa 
gaagcaccca 
tgcccttcca 
cttgggcttt 
tgaaggacgg 
ttatccaccc 
gaactgcaca 



gaggagtaat 
agctgccacc 
cagcctcagt 
tcctgtggca 
caacaggaag 
aaactgtatt 
atatgactgc 



cattttaagg 60 
tacattaaaa 120 
ctgaaaaata 180 
gtatctctgg 240 
gactccatga 300 
tcagacttta 360 
ttttaaccgt 420 
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agtgatttca aatattgagc catgctgttg cagtcttaaa aactggagac ctaagggcag 480 
ctttcttcta gtcacccaat ccagcacttt tttaaaaaat cagtaaaact cttcgaccac 540 
caaggaaaaa aaaaaaggat ggaggttaaa agacgcaccc cttgcccaca agccccctca 600 
tcagaatggg agtcaggaga cctgagttcc tgtctcaggc ctgccattaa aaacctgcat 660 
aacctttgcc tatctcctca aacggaagta ctaaaacctc agcgcttcac ccaatttgta 720 
gccccggctg ggctcttccc accttcccct tcttcagccc gccccttcct cctccagccc 780 
tatcatcggg cggagggtcc ccgcctccgc ccgccttacc cacaagcccc gcccccccag 840 
ccccgatggc cctgcccagt cccagacaga acctactacg- tgcggcggca gctggggcgg 900 
gaaggcgggc gctgggggcg* ctgcggccgc tgcagcgcag ggtccacctg gtcggctgca 960 
cctgtggagg aggaggtgga tttcaggctt cccgtagact ggaagaatcg gctcaaaacc 1020 
gcttgcctcg caggggctga gctggaggca gcgaggccgc ccgacgcagg cttccggcga 1080 
gacatggcag ggcaaggatg gcagcccggc ggcagggccc ggcgaggagc gcgaacccgc 1140 
ggccgcagtt cccaggcgtc tgcgggcgcg agcacgccgc gaccctgcgt gcgccggggc 1200 
gggggggcgg ggcctcgcct gcacaaatag ggacgagggg gcggggcggc cacaatttcg 1260 
cgccaaactt gaccgcgcgt tctgctgtaa cgagcgggct cggaggtcct cccgctgctg 1320 
tcatggttgg ttcgctaaac tgcatcgtcg ctgtgtccca gaacatgggc atcggcaaga 1380 
acggggacct gccctggcca ccgctcaggt atctgccggg ccggggcgat gggacccaaa 1440 
cgggcgcagg ctgcccacgg tcggggtacc tgggcgggac gcgccggccg actcccggcg 1500 
agaggatggg gccagacttg cggtctgcgc tggcaggaag ggtgggcccg actggattcc 1560 
ccttttctgc tgcgcgggag gcccagttgc tgatttctgc ccggattctg ctgcccggtg. 1620 
aggtcttgcc ctgcggcgcc ctcgcccagg gcaaagtccc agccctggag aaaacacctc 1680 
acccctaccc acagcgctcc gtttgtcagg tgccttagag ctcgagccca agggataatg 1740 
tttcgagtaa cgctgtttct ctaacttgta ggaatgaatt cagatatttc cagagaatga 1800 
ccacaacctc ttcagtagaa ggtaatgtgg gattaagtag ggtcttgctt gatgaagttt 1860 
accagtgcaa atgttagtta aatggaaagt tttccgtgtt aatctggg 1908 

<210> 37 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 37 

cccacggtcg gggtggccga ctcccggcga 

<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 38 

ctaaactgca tcgtcgctgt g 21 



<210> 39 
<211> 19 
<212> DNA 
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agtgatttca aatattgagc catgctgttg cagtcttaaa aactggagac 
ctttcttcta gtcacccaat ccagcacttt tttaaaaaat cagtaaaact 
caaggaaaaa aaaaaaggat ggaggttaaa agacgcaccc cttgcccaca 
tcagaatggg agtcaggaga cctgagttcc tgtctcaggc ctgccattaa 
aacctttgcc tatctcctca aacggaagta ctaaaacctc agcgcttcac 
gccccggctg ggctcttccc accttcccct tcttcagccc gccccttcct 
tatcatcggg cggagggtcc ccgcctccgc ccgccttacc cacaagcccc 
ccccgatggc cctgcccagt cccagacaga acctactacg tgcggcggca 
gaaggcgggc gctgggggcg ctgcggccgc tgcagcgcag ggtccacctg 
cctgtggagg aggaggtgga tttcaggctt cccgtagact ggaagaatcg 
gcttgcctcg caggggctga gctggaggca gcgaggccgc ccgacgcagg 
gacatggcag ggcaaggatg gcagcccggc ggcagggccc ggcgaggagc 
ggccgcagtt cccaggcgtc tgcgggcgcg agcacgccgc gaccctgcgt 
gggggggcgg ggcctcgcct gcacaaatag ggacgagggg gcggggcggc 
cgccaaactt gaccgcgcgt tctgctgtaa cgagcgggct cggaggtcct 
tcatggttgg ttcgctaaac tgcatcgtcg ctgtgtccca gaacatgggc 
acggggacct gccctggcca ccgctcaggt atctgccggg ccggggcgat 
cgggcgcagg ctgcccacgg tcggggtacc tgggcgggac gcgccggccg 
agaggatggg gccagacttg cggtctgcgc tggcaggaag ggtgggcccg 
ccttttctgc tgcgcgggag gcccagttgc tgatttctgc ccggattctg 
aggtcttgcc ctgcggcgcc ctcgcccagg gcaaagtccc agccctggag 
acccctaccc acagcgctcc gtttgtcagg tgccttagag ctcgagccca 
tttcgagtaa cgctgtttct ctaacttgta ggaatgaatt cagatatttc 
ccacaacctc ttcagtagaa ggtaatgtgg gattaagtag ggtcttgctt 
accagtgcaa atgttagtta aatggaaagt tttccgtgtt aatctggg 

<210> 37 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

cccacggtcg gggtggccga ctcccggcga r 30 

<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 38 

ctaaactgca tcgtcgctgt g 21 

<210> 39 
<211> 19 
<212> DNA 



ctaagggcag 480 
cttcgaccac 540 
agccccctca 600 
aaacctgcat 660 
ccaatttgta 720 
cctccagccc 780 
gcccccccag 840 
gctggggcgg 900 
gtcggctgca 960 
gctcaaaacc 1020 
cttccggcga 1080 
gcgaacccgc 1140 
gcgccggggc 1200 
cacaatttcg 1260 
cccgctgctg 1320 
atcggcaaga 1380 
gggacccaaa- 1440 
actcccggcg 1500 
actggattcc 1560 
ctgcccggtg 1620 
aaaacacctc 1680 
agggataatg 1740 
cagagaatga 1800 
gatgaagttt 1860 
1908 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

aaaaggggaa tccagtcgg 19 

<210> 40 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR product 
<400> 40 

acctgggcgg gacgcgcca 19 

<210> 41 

<211> 1275 

<212> DNA 

<213> Homo sapiens 

<400> 41 

ctgcagcgcc agggtccacc tggtcggctg cacctgtgga ggaggaggtg gatttcaggc 60 
ttcccgtaga ctggaagaat cggctcaaaa ccgcttgcct cgcaggggct gagctggagg 120 
cagcgaggcc gcccgacgca ggcttccggc gagacatggc agggcaagga tggcagcccg 180 
gcggcagggc ccggcgagga gcgcgaaccc gcggccgcag ttcccaggcg tctgcgggcg 240 
cgagcacgcc gcgaccctgc gtgcgccggg gcgggggggc ggggcctcgc ctgcacaaat 300 
agggacgagg gggcggggcg gccacaattt cgcgccaaac ttgaccgcgc gttctgctgt 360 
aacgagcggg ctcggaggtc ctcccgctgc tgtcatggtt ggttcgctaa actgcatcgt 420 
cgctgtgtcc cagaacatgg gcatcggcaa gaacggggac ctgccctggc caccgctcag 480 
gtatctgccg ggccggggcg atgggaccca aacgggcgca ggctgcccac ggtcggggta 540 
cctgggcggg acgcgccagg ccgactcccg gcgagaggat ggggccagac ttgcggtctg 600 
cgctggcagg aagggtgggc ccgactggat tccccttttc tgctgcgcgg gaggcccagt 660 
tgctgatttc tgcccggatt ctgctgcccg gtgaggtctt tgccctgcgg cgccctcgcc 720 
cagggcaaag tcccagccct ggagaaaaca cctcacccct acccacagcg ctccgtttgt 780 
caggtgcctt agagctcgag cccaagggat aatgtttcga gtaacgctgt ttctctaact 840 
tgtaggaatg aattcagata tttccagaga atgaccacaa cctcttcagt agaaggtaat 900 
gtgggattaa gtagggtctt gcttgatgaa gtttaccagt gcaaatgtta gttaaatgga 960 
aagttttccg tgttaatctg ggaccttttc tcttattatg gatctgtatg atctgtatgc 1020 
agttcccaag gttcatttac cattattaaa aaatttttgt cttagaaatt ttatgtatgt 1080 
caacgcacga gcaaattatc aggcatgggg cagaattggc aactgggtgg aggcttcggt 1140 
ggaggttagc actccgaaag gaaaacagag taggcctttg gaacagctgc tggaagagat 1200 
aaggcctgaa caagggcagt ggagaagaga gggtaaaaat tttttaaggt tacatgaccc 1260 
tggattttgg agate 127 5 

<210> 42 
<211> 1256 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 39 

aaaaggggaa tccagtcgg 

<210> 40 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR product 



<400> 40 

acctgggcgg gacgcgcca 19 

<210> 41 

<211> 1275 

<212> DNA 

<213> Homo sapiens 

<400> 41 

ctgcagcgcc agggtccacc tggtcggctg cacctgtgga ggaggaggtg gatttcaggc 60 
ttcccgtaga ctggaagaat cggctcaaaa ccgcttgcct cgcaggggct gagctggagg 120 
cagcgaggcc gcccgacgca ggcttccggc gagacatggc agggcaagga tggcagcccg 180 
gcggcagggc ccggcgagga gcgcgaaccc gcggccgcag ttcccaggcg tctgcgggcg 240 
cgagcacgcc gcgaccctgc gtgcgccggg gcgggggggc ggggcctcgc ctgcacaaat 300 
agggacgagg gggcggggcg gccacaattt cgcgccaaac ttgaccgcgc gttctgctgt 360 
aacgagcggg ctcggaggtc ctcccgctgc tgtcatggtt ggttcgctaa actgcatcgt 420 
cgctgtgtcc cagaacatgg gcatcggcaa gaacggggac ctgccctggc caccgctcag 480 
gtatctgccg ggccggggcg atgggaccca aacgggcgca ggctgcccac ggtcggggta 540 
cctgggcggg acgcgccagg ccgactcccg gcgagaggat ggggccagac ttgcggtctg 600 
cgctggcagg aagggtgggc ccgactggat tccccttttc tgctgcgcgg gaggcccagt 660 
tgctgatttc tgcccggatt ctgctgcccg gtgaggtctt tgccctgcgg cgccctcgcc 720 
cagggcaaag tcccagccct ggagaaaaca cctcacccct acccacagcg ctccgtttgt 780 
caggtgcctt agagctcgag cccaagggat aatgtttcga gtaacgctgt ttctctaact 840 
tgtaggaatg aattcagata tttccagaga atgaccacaa cctcttcagt agaaggtaat 900 
gtgggattaa gtagggtctt gcttgatgaa gtttaccagt gcaaatgtta gttaaatgga 960 
aagttttccg tgttaatctg ggaccttttc tcttattatg gatctgtatg atctgtatgc 1020 
agttcccaag gttcatttac cattattaaa aaatttttgt cttagaaatt ttatgtatgt 1080 
caacgcacga gcaaattatc aggcatgggg cagaattggc aactgggtgg aggcttcggt 1140 
ggaggttagc actccgaaag gaaaacagag taggcctttg gaacagctgc tggaagagat 1200 
aaggcctgaa caagggcagt ggagaagaga gggtaaaaat tttttaaggt tacatgaccc 1260 
tggattttgg agate 1275 

<210> 42 
<211> 1256 
<212> DNA 
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<213> Homo sapiens 
<400> 42 

ctgcagcgcc agggtccacc tggtcggctg cacctgtgga ggaggaggtg gatttcaggc 60 

ttcccgtaga ctggaagaat cggctcaaaa ccgcttgcct cgcaggggct gagctggagg 120 

cagcgaggcc gcccgacgca ggcttccggc gagacatggc agggcaagga tggcagcccg 180 

gcggcagggc ccggcgagga gcgcgaaccc gcggccgcag ttcccaggcg tctgcgggcg 240 

cgagcacgcc gcgaccctgc gtgcgccggg gcgggggggc ggggcctcgc ctgcacaaat 300 

agggacgagg gggcggggcg gccacaattt cgcgccaaac ttgaccgcgc gttctgctgt 360 

aacgagcggg ctcggaggtc ctcccgctgc tgtcatggtt ggttcgctaa actgcatcgt 420 

cgctgtgtcc cagaacatgg gcatcggcaa gaacggggac ctgccctggc caccgctcag 480 

gtatctgccg ggccggggcg atgggaccca aacgggcgca ggctgcccac ggtcggggtg 540 

gccgactccc ggcgagagga tggggccaga cttgcggtct gcgctggcag gaagggtggg 600 

cccgactgga ttcccctttt ctgctgcgcg ggaggcccag ttgctgattt ctgcccggat 660 

tctgctgccc ggtgaggtct ttgccctgcg gcgccctcgc ccagggcaaa gtcccagccc 720 

tggagaaaac acctcacccc tacccacagc gctccgtttg tcaggtgcct tagagctcga 780 

gcccaaggga taatgtttcg agtaacgctg tttctctaac ttgtaggaat gaattcagat 840 

atttccagag aatgaccaca acctcttcag tagaaggtaa tgtgggatta agtagggtct 900 

tgcttgatga agtttaccag tgcaaatgtt agttaaatgg aaagttttcc gtgttaatct 960 

gggacctttt ctcttattat ggatctgtat gatctgtatg cagttcccaa ggttcattta 1020 

ccattattaa aaaatttttg tcttagaaat tttatgtatg tcaacgcacg agcaaattat 1080 

caggcatggg gcagaattgg caactgggtg gaggcttcgg tggaggttag cactccgaaa 1140 

ggaaaacaga gtaggccttt ggaacagctg ctggaagaga taaggcctga acaagggcag 1200 

tggagaagag agggtaaaaa . ttttttaagg ttacatgacc ctggattttg gagatc * 1256 

<210> 43 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR product 
<400> 43 

gctgcccacg gtcggggtac ctgggcggga cgcgccaggc cgactcccgg cgaga 55 

<210> 44 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR product 
<400> 44 

gctgcccacg gtcggggtgg ccgactcccg gcgaga 36 

<210> 45 

<211> 1273 

<212> DNA 

<213> Homo sapiens 
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<213> Homo sapiens 
<400> 42 

ctgcagcgcc agggtccacc tggtcggctg cacctgtgga ggaggaggtg gatttcaggc 60 
ttcccgtaga ctggaagaat cggctcaaaa ccgcttgcct cgcaggggct gagctggagg 120 
cagcgaggcc gcccgacgca ggcttccggc gagacatggc agggcaagga tggcagcccg 180 
gcggcagggc ccggcgagga gcgcgaaccc gcggccgcag ttcccaggcg tctgcgggcg 240 
cgagcacgcc' gcgaccctgc gtgcgccggg gcgggggggc ggggcctcgc ctgcacaaat 300 
agggacgagg gggcggggcg gccacaattt cgcgccaaac ttgaccgcgc gttctgctgt 360 
aacgagcggg ctcggaggtc ctcccgctgc tgtcatggtt ggttcgctaa actgcatcgt 420 
cgctgtgtcc cagaacatgg gcatcggcaa gaacggggac ctgccctggc caccgctcag 480 
gtatctgccg ggccggggcg atgggaccca aacgggcgca ggctgcccac ggtcggggtg. 540 
gccgactccc ggcgagagga tggggccaga cttgcggtct gcgctggcag gaagggtggg 600 
cccgactgga ttcccctttt ctgctgcgcg ggaggcccag ttgctgattt ctgcccggat 660 
tctgctgccc ggtgaggtct ttgccctgcg gcgccctcgc ccagggcaaa gtcccagccc 720 
tggagaaaac acctcacccc tacccacagc gctccgtttg tcaggtgcct tagagctcga 780 
gcccaaggga taatgtttcg agtaacgctg tttctctaac ttgtaggaat gaattcagat 840 
atttccagag aatgaccaca acctcttcag tagaaggtaa tgtgggatta agtagggtct 900 
tgcttgatga agtttaccag tgcaaatgtt agttaaatgg aaagttttcc gtgttaatct 960 
gggacctttt ctcttattat ggatctgtat gatctgtatg cagttcccaa ggttcattta' 1020 
ccattattaa aaaatttttg tcttagaaat tttatgtatg tcaacgcacg agcaaattat 1080 
caggcatggg gcagaattgg caactgggtg gaggcttcgg tggaggttag cactccgaaa- 1140 
ggaaaacaga gtaggccttt ggaacagctg ctggaagaga taaggcctga acaagggcag 1200 
tggagaagag agggtaaaaa ttttttaagg ttacatgacc ctggattttg gagatc 1256 

<210> 43 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR product 
<400> 43 

gctgcccacg gtcggggtac ctgggcggga cgcgccaggc cgactcccgg cgaga 55 

<210> 44 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR product 
<400> 44 

gctgcccacg gtcggggtgg ccgactcccg gcgaga 36 

<210> 45 

<211> 1273 

<212> DNA 

<213> Homo sapiens 
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<400> 45 

ctgcagcgca 

tcccgtagac 

agcgaggccg 

cggcagggcc 

gagcacgccg 

gggacgaggg 

acgagcgggc 

gctgtgtccc 

tatctgccgg 

ctgggcggga 

ctggcaggaa 

ctgatttctg 

gggcaaagtc 

ggtgccttag 

taggaatgaa 

gggattaagt 

gttttccgtg 

ttcccaaggt 

acgcacgagc 

aggttagcac 

ggcctgaaca 

gattttggag 



gggtccacct 
tggaagaatc 
cccgacgcag 
cggcgaggag 
cgaccctgcg 
ggcggggcgg 
tcggaggtcc 
agaacatggg 
gccggggcga 
cgcgccggcc 
gggtgggccc 
cccggattct 
ccagccctgg 
agctcgagcc 
ttcagatatt 
agggtcttgc 
ttaatctggg 
tcatttacca 
aaattatcag 
tccgaaagga 
agggcagtgg 
ate 



ggtcggctgc 
ggctcaaaac 
gcttccggcg 
cgcgaacccg 
tgcgccgggg 
ccacaatttc 
tcccgctgct 
categgcaag 
tgggacccaa 
gactcccggc 
gactggattc 
gctgcccggt 
agaaaacacc 
caagggataa 
tccagagaat 
ttgatgaagt 
accttttctc 
ttattaaaaa 
gcatggggca 
aaacagagta 
agaagagagg 



acctgtggag 
cgcttgcctc 
agacatggca 
cggccgcagt 
egggggggeg 
gcgccaaact 
gtcatggttg 
aaeggggace 
aegggegcag 
gagaggatgg 
cccttttctg 
gaggtctttg 
tcacccctac 
tgtttcgagt 
gaccacaacc 
ttaccagtgc 
ttattatgga 
atttttgtct 
gaattggcaa 
ggcctttgga 
gtaaaaattt 



gaggaggtgg 
gcaggggctg 
gggcaaggat 
tcccaggcgt 
gggcctcgcc 
tgaccgcgcg 
gttcgctaaa 
tgccctggcc 
gctgcccacg 
ggecagaett 
ctgcgcggga 
ccctgcggcg 
ccacagcgct 
aacgctgttt 
tcttcagtag 
aaatgttagt 
tctgtatgat 
tagaaatttt 
ctgggtggag 
acagctgetg 
tttaaggtta 



atttcaggct 
agctggaggc 
ggcagcccgg 
ctgcgggcgc 
tgcacaaata 
ttctgctgta 
ctgcatcgtc 
accgctcagg 
gteggggtae 
gcggtctgcg 
ggcccagttg 
ccctcgccca 
ccgtttgtca 
ctctaacttg 
aaggtaatgt 
taaatggaaa 
ctgtatgcag 
atgtatgtca 
gcttcggtgg 
gaagagataa 
catgaccctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

12 60 

1273 



<210> 46 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR product 



<400> 46 

acctgggegg gacgcgcc 



18 
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<400> 45 

ctgcagcgca gggtccacct ggtcggctgc 
tcccgtagac tggaagaatc ggctcaaaac 
agcgaggccg cccgacgcag gcttccggcg 
cggcagggcc cggcgaggag cgcgaacccg 
gagcacgccg cgaccctgcg tgcgccgggg 
gggacgaggg ggcggggcgg ccacaatttc 
acgagcgggc tcggaggtcc tcccgctgct 
gctgtgtccc agaacatggg catcggcaag 
tatctgccgg gccggggcga tgggacccaa 
ctgggcggga cgcgccggcc gactcccggc 
ctggcaggaa gggtgggccc gactggattc 
ctgatttctg cccggattct gctgcccggt 
gggcaaagtc ccagccctgg agaaaacacc 
ggtgccttag agctcgagcc caagggataa 
taggaatgaa ttcagatatt tccagagaat 
gggattaagt agggtcttgc ttgatgaagt 
gttttccgtg ttaatctggg accttttctc 
ttcccaaggt tcatttacca ttattaaaaa 
acgcacgagc aaattatcag gcatggggca 
aggttagcac tccgaaagga aaacagagta 
ggcctgaaca agggcagtgg agaagagagg 
gattttggag ate 



acctgtggag gaggaggtgg atttcaggct 60 
cgcttgcctc gcaggggctg agctggaggc 120 
agacatggca gggcaaggat ggcagcccgg 180 
cggccgcagt tcccaggcgt ctgcgggcgc 240 
egggggggeg gggcctcgcc tgcacaaata 300 
gcgccaaact tgaccgcgcg ttctgctgta 360 
gtcatggttg gttcgctaaa ctgcatcgtc 420 
aaeggggace tgccctggcc accgctcagg 480 
aegggegcag gctgcccacg gteggggtae 540 
gagaggatgg ggecagaett gcggtctgcg 600 , 
cccttttctg ctgcgcggga ggcccagttg 660 
gaggtctttg ccctgcggcg ccctcgccca 720 
tcacccctac ccacagcgct ccgtttgtca 780 
tgtttcgagt aacgctgttt ctctaacttg 840 
gaccacaacc tcttcagtag aaggtaatgt 900 
ttaccagtgc aaatgttagt taaatggaaa 960 
ttattatgga fcctgtatgat ctgtatgcag 1020 
atttttgtct tagaaatttt atgtatgtca 1080 
gaattggcaa ctgggtggag gcttcggtgg 1140^ 
ggcctttgga acagctgetg gaagagataa 1200 
gtaaaaattt tttaaggtta catgaccctg 12 60' 

1273 



<210> 46 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR product - 
<400> 46 

acctgggegg gacgcgcc 18 



43 



